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CACNA1G POLYNUCLEOTIDE , POLYPEPTIDE and METHODS OF USE 
THEREFOR 

STATEMENT AS TO FEDERALLY SPONSORED RESEARCH 

5 

This invention was made with Government support under Grant No. CA433 18 
and CA54396, awarded by the National Cancer Institute and Grant No. CA43318, a 
Colon Cancer Spore Grant. The government may have certain rights in the invention. 

10 FIELD OF THE INVENTION 

The present invention relates generally to the regulation of gene expression 
and more specifically to a method of determining the DNA methylation status of CpG 
sites in a given locus and correlating the methylation status with the presence of a cell 
15 proliferative disorder. 

BACKGROUND OF THE INVENTION 

DNA methylases transfer methyl groups from the universal methyl donor S- 
20 adenosyl methionine to specific sites on the DNA. Several biological functions have 
been attributed to the methylated bases in DNA. The most established biological 
function for methylated DNA is the protection of DNA from digestion by cognate 
restriction enzymes. The restriction modification phenomenon has, so far, been 
observed only in bacteria. Mammalian cells, however, possess a different methylase 
25 that exclusively methy-lates cytosine residues that are 5' neighbors of guanine (CpG). 
This modification of cytosine residues has important regulatory effects on gene 
expression, especially when involving CpG rich areas, known as CpG islands, located 
in the promoter regions of many genes. 



30 



Methylation has been shown by several lines of evidence to play a role in gene 
activity, cell differentiation, tumorigenesis, X-chromosome inactivation, genomic 
imprinting and other major biological processes (Razin, A., H., and Riggs, R.D. eds. 
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in DNA Methylation Biochemis-try and Biological Significance . Springer- Verlag, 
New York, 1984). In eukaryotic cells, methylation of cytosine residues that are 
immediately 5' to a guanosine, occurs predominantly in CG poor regions (Bird, A., 
Nature, 221:209, 1986). In contrast, CpG islands remain unmethylated in normal 
5 cells, except during X-chromosome inactivation (Migeon, et al., supra) and parental 
specific imprinting (Li, et al., Nature, 366 :362. 1993) where methylation of 5' 
regulatory regions can lead to transcriptional repression. De novo methylation of the 
Rb gene has been demonstrated in a small fraction of retinoblastomas (Sakai, et al., 
Am. J. Hum. Genet, 48:880, 1991), and recently, a more detailed analysis of the VHL 

10 gene showed aberrant methylation in a subset of sporadic renal cell carcinomas 

(Herman, et al., Proc. Natl. Acad. Sci., U.S.A., 91:9700, 1994). Expression of a tumor 
suppressor gene can also be abolished by de novo DNA methylation of a normally 
unmethylated CpG island (Issa, et al., Nature Genet., 7:536, 1994; Herman, et al., 
supra; Merlo, et al., Nature Med., 1:686, 1995; Herman, et al., Cancer Res., 5f>:722, 

15 1996; Graff, et al., Cancer Res., 55:5195, 1995; Herman, et al., Cancer Res., 55:4525, 
1995). 

Human cancer cells typically contain somatically altered nucleic acid, 

character-ized by mutation, amplification, or deletion of critical genes. In addition, 
20 the nucleic acid from human cancer cells often displays somatic changes in DNA 

methylation (E.R. Fearon, et al., Cell, 61:759. 1990; P.A. Jones, et al., Cancer Res., 

46:461, 1986; R. Holliday, Science, 238:163, 1987; A. De Bustros, et al., Proc. Natl. 

Acad. Sci., USA, £5:5693, 1988); P.A. Jones, et al., Adv. Cancer Res., 54: 1, 1990; 

S.B. Baylin, et al., Cancer Cells,_3:383, 1991; M. Makos, et al., Proc. Natl. Acad. Sci., 
25 USA, 82:1929, 1992; N. Ohtani-Fujita, et al., Onco-gene, 8:1063, 1993). However, 

the precise role of abnormal DNA methylation in human tumorigenesis has not been 

established. 

Aberrant methylation of normally unmethylated CpG islands has been 
described as a frequent event in immortalized and transformed cells, and has been 
30 associated with transcriptional inactivation of defined tumor suppressor genes in 



WO 01/019845 



PCT/US00/25479 



human cancers. In the development of colorectal cancers (CRC), a series of tumor 
suppressor genes (TSG) such as APC, p53, DCC and DPC4 are inactivated by 
mutations and chromosomal deletions (reviewed in Kinzler and Vogelstein 1996). 
Some of these alterations result from a chromosomal instability phenotype described 
5 in a subset of CRC (Lengauer et al., 1997a). Recently, an additional pathway has been 
shown to be involved in a familial form of CRC, hereditary non-polyposis colorectal 
cancer. The cancers from these patients show a characteristic mutator phenotype 
which causes microsatellite instability (MI), and mutations at other gene loci such as 
TGF-beta-RII (Markowitz et al., 1995) and BAX (Rampino et al., 1997). This 
10 phenotype usually results from mutations in the mismatch repair (MMR) genes 

hMSH2 and hMLHl (reviewed by Peltomaki, and de la Chapelle, 1997). A subset of 
sporadic CRC also show MI, but mutations in MMR genes appear to be less frequent 
in these tumors (Liu et ah, 1995; Moslem et al., 1996). 

15 Another molecular defect described in CRC is CpG island (CGI) methylation. 

CGIs are short sequences rich in the CpG dinucleotide and can be found in the 5' 
region of about half of all human genes. Methylation of cytosine within 5 1 CGIs is 
associated with loss of gene expression and has been seen in physiological conditions 
such as X chromosome inactivation and genomic imprinting (reviewed in Latham, 

20 1996). Aberrant methylation of CGIs has been detected in genetic diseases such as the 
fragile-X syndrome, in aging cells and in neoplasia. About half of the tumor 
suppressor genes which have been shown to be mutated in the germline of patients 
with familial cancer syndromes have also been shown to be aberrantly methylated in 
some proportion of sporadic cancers, including Rb, VHL, pl6, hMLHl, and BRCA1 

25 (reviewed in Baylin, S.B., Herman, J.G., Graff^J.R., Vertino, P.M. and Issa, J.P. 

Alterations in DNA methylation: a fundamental aspect of neoplasia, Adv. Cancer Res. 
22:141-196 1998). TSG methylation in cancer is usually associated with (1) lack of 
gene transcription and (2) absence of coding region mutation. Thus it has been 
proposed that CGI methylation serves as an alternative mechanism of gene 

30 inactivation in cancer. 
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The causes and global patterns of CGI methylation in human cancers remain 
poorly defined. Aging could play a factor in this process because methylation of 
several CGIs could be detected in an age-related manner in normal colon mucosa as 
well as in CRC (Issa, J.P., Vertino. P.M., Boehm, CD., Newsham, I.F. and Baylin, 
5 S.B. Methylation of the oestrogen receptor CpG island links ageing and neoplasia in 
human colon. Nat Genet. 2:536-540, 1994). In addition, aberrant methylation of CGIs 
has been associated with the MI phenotype in CRC as well as specific carcinogen 
exposures (Issa et al., 1996) supra. However, an understanding of aberrant 
methylation in CRC has been somewhat limited by the small number of CGIs 

10 analyzed to date. In fact, previous studies have suggested that large numbers of CGIs 
are methylated in immortalized cell lines (Antequera, F., Boyes, J. and Bird, A. High 
levels of de novo methylation and altered chromatin structure at CpG islands in cell 
lines. Cell 62:503-514, 1990), and it is not well understood whether this global 
aberrant methylation is caused by the cell culture conditions or whether they are an 

15 integral part of the pathogenesis of cancer. 

Most of the methods developed to date for detection of methylated cytosine 
depend upon cleavage of the phosphodiester bond alongside cytosine residues, using 
either methylation-sensitive restriction enzymes or reactive chemicals such as 

20 hydrazine which differentiate between cytosine and its 5-methyl derivative. Genomic 
sequencing protocols which identify a 5-MeC residue in genomic DNA as a site that is 
not cleaved by any of the Maxam Gilbert sequencing reactions have also been used, 
but still suffer disadvantages such as the requirement for large amount of genomic 
DNA and the difficulty in detecting a gap in a sequencing ladder which may contain 

25 bands of varying intensity. 

Mapping of methylated regions in DNA has relied primarily on Southern 
hybridization approaches, based on the inability of methylation-sensitive restriction 
enzymes to cleave sequences which contain one or more methylated CpG sites. This 
30 method provides an assessment of the overall methylation status of CpG islands, 
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including some quantitative analysis, but is relatively insensitive and requires large 
amounts of high molecular weight DNA. 

Another method utilizes bisulfite treatment of DNA to convert all 
5 unmethylated cytosines to uracil. The altered DNA is amplified and sequenced to 
show the methylation status of all CpG sites. However, this method is technically 
difficult, labor intensive and without cloning amplified products, it is less sensitive 
than Southern analysis, requiring approximately 10% of the alleles to be methylated 
for detection. 

10 

Identification of the earliest genetic changes in tumorigenesis is a major focus 
in molecular cancer research. Diagnostic approaches based on identification of these 
changes are likely to allow implementation of early detection strategies and novel 
therapeutic approaches targeting these early changes might lead to more effective 
1 5 cancer treatment. 

SUMMARY OF THE INVENTION 

The present invention is based on the finding that several genes are newly 
20 identified as being differentially methylated in cancer. This seminal discovery is 
useful for cancer screening, risk-assessment, prognosis, minimal-residual disease 
identification, staging and identification of therapeutic targets. The identification of 
new genes that are methylated in cancer, aging or diseases associated with aging 
increases the likelihood of finding genes methylated in a particular cancer; increases 
25 the sensitivity and specificity of methylation detection; allows methylation profiling 
using multiple genes; and allows identification of new targets for therapeutic 
intervention. The invention also provides a newly identified gene that is a target for 
hypermethylation in human tumors. This new gene, as well as genes newly identified 
as hypermethylated in cancer and aging or aging diseases provides markers which can 
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be used diagnostically, prognostically and therapeutically over the course of such 
disorders. 

In a first embodiment, the invention provides a nucleic acid molecule 
5 comprising a coding region for a T-type calcium channel, CACNA1G, and regulatory 
sequences associated therewith. The discovery of CpG islands, and in particular, 
methylated CpG islands in the region approximately 300-800 base pairs upstream 
from the CACNA1G translation initiation start site, led to a method of the present 
invention for correlating methylated CpG islands with various cancers. In one aspect 
10 of this embodiment, the nucleic acid molecule encoding CACNA1G and the 
associated regulatory sequences and CpG-rich regions include the nucleic acid 
sequence set forth in SEQ ID NO:51 (Figure 3A). Also provided is a polypeptide 
having an amino acid sequence as set forth in SEQ ID NO: 52 and Figure 3B. The 
methylation state of CpG islands in CACNA1G, associated regulatory regions, and 
15 other genes is indicative of the presence of a cellular proliferative disorder in a subject 
from which the CpG-containing nucleic acid is isolated. 

In another embodiment, there are provided methods for detecting a cellular 
proliferative disorder in a subject. The subject may have or be at risk of having a 

20 cellular proliferative disorder. The method of the invention is useful for diagnostic as 
well as prognostic analyses. One method for detecting a cellular proliferative disorder 
in a subject includes contacting a nucleic acid-containing specimen from the subject 
with an agent that provides a determination of the methylation state of at least one 
gene or associated regulatory region of the gene; and identifying aberrant methylation 

25 of regions of the gene or regulatory region, wherein aberrant methylation is identified 
as being different when compared to the same regions of the gene or associated 
regulatory region in a subject not having the cellular proliferative, thereby detecting a 
cellular proliferative disorder in the subject. The method includes multiplexing by 
utilizing a combination of primers for more than one loci, thereby providing a 

30 methylation "profile" for more than one gene or regulatory region. 
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For the first time, the invention provides methylated forms of the following 
genes and/or their associated regulatory sequences: APOB, CACNA1G, CDX2, 
EGFR, FBN1, GPR37, HSPA6, IQGAP2 3 KL, PAR2, PITX2, PTCH, SDC1, and 
SDC4 (see Table 5). In addition, the invention provides the CpG-rich regions from 
5 these genes that are hypermethylated (see Figures 4A-4F (SEQ ID NO:105-1 19). 

Invention methods include determining, in a nucleic acid-containing specimen 
taken from a subject, the methylation state of a gene or regulatory sequences 
associated therewith, wherein the expression or non-expression of the gene is 

10 associated with the presence of the cellular proliferative disorder, and identifying as 
having a cellular proliferative disorder a subject that has aberrant methylation of 
regions of the gene or associated regulatory sequences when compared to the same 
regions of the gene in a subject not having the cellular proliferative disorder. In one 
aspect of this embodiment, the methylated regions of the gene and associated 

15 regulatory sequences are contained within CpG islands (i.e., CpG rich regions). In 
particular, the aberrant methylation typically includes hypermethylation as compared 
with the same regions of the gene or regulatory sequences in a subject not having the 
cellular proliferative disorder. 

20 Determining the methylation state of the gene includes contacting the nucleic 

acid-containing specimen with an agent that modifies unmethylated cytosine, 
amplifying a CpG-containing nucleic acid in the specimen by means of CpG-specific 
oligonucleotide primers, wherein the oligonucleotide primers distinguish between 
modified methylated and nonmethylated nucleic acid, and detecting the methylated 

25 nucleic acid based on the presence or absence of amplification products produced in 
said amplifying step. The method includes optionally contacting the amplification 
products with a methylation sensitive restriction endonuclease. Other methods for 
determining methylation status of a gene and/or regulatory sequences are well known 
in the art and are described more fully herein. 
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In another embodiment, the present invention provides a method of treating a 
cell proliferative disorder associated with CACNA1G or other methylated genes 
described herein, comprising administering to a subject with the disorder, a 
therapeutically effective amount of reagent which modulates CACNA1G or other 
5 methylated genes' expression. For example, since CACNA1G -associated disorders 
typically involve hypermethylation of CACNA1G polynucleotide sequence, a 
polynucleotide sequence which contains a non-methylatable nucleotide analog is 
utilized for treatment of a subject. Further, the invention provides a method of gene 
therapy comprising introducing into cells of a host subject, an expression vector 
10 comprising a nucleotide sequence encoding CACNA1G or other methylated genes 
described herein, in operable linkage with a promoter. 

In another embodiment of the present invention there is provided a kit useful 
for the detection of a cellular proliferative disorder in a subject having or at risk for 

15 having a cellular proliferative disorder. Invention kits include a carrier means 

compartmentalized to receive a sample, one or more containers comprising a first 
container containing a reagent which modifies unmethylated cytosine and a second 
container containing primers for amplification of a CpG-containing nucleic acid, 
wherein the primers distinguish between modified methylated and nonmethylated 

20 nucleic acid, and optionally, a third container containing a methylation sensitive 
restriction endonuclease. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A shows the relative positions of MINT31 and CACNA1G and 
associated CpG regions. 
5 Figure IB provides a magnified depiction of MINT31, CACNA1G and CpG 

rich regions 1-8. 

Figure 2 depicts the correlation of methylation and the expression of 
CACNA1G. 

Figures 3 A and 3B show the nucleic acid sequence and deduced amino acid 
10 sequence of CACNA1G (SEQ ID NO:51 and 52, respectively). 

Figures 4A-4F show the CpG-rich regions of the genes depicted in Table 5 
(SEQ ID NO: 105-1 19). 

Figure 5 is the nucleotide sequence of MINT31. (SEQ ID NO:120). 

15 DETAILED DESCRIPTION OF THE INVENTION 

It has been determined that an aberrant methylation state of nucleic acids in 
certain genes, particularly regulatory sequences, is diagnostic for the presence or 
potential development of a cellular proliferative disorder in subjects bearing the 

20 aberrantly methylated nucleic acids. More particularly, the hypermethylation of 

certain nucleotides localized in CpG islands has been shown to affect the expression 
of genes associated with the CpG islands; typically such hypermethylated genes have 
reduced or abolished expression, primarily due to down-regulated transcription. 
Using a recently developed PCR-based technique called methylated CpG island 

25 amplification (MCA), several nucleic acid molecules aberrantly methylated in a colon 
cancer cell line were identified. One DNA fragment, termed MINTS 1, mapped to 
human chromosome 17q21 where frequent loss of heterozygosity (LOH) has been 
detected in various human tumors. By characterizing the genomic sequence around 
this area, a gene encoding a T-type calcium channel, CACNA1G, was identified as a 

30 target for hypermethylation in human tumors. Using RT-PCR, expression of 
CACNA1G was detected in normal colon and bone marrow, but expression was 
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absent in 5 tumor cell lines where methylation was found. After treatment with the 
methylation inhibitor 5-deoxy~azacytidine, the expression of CACNA1G was restored 
in all 5 cell lines. Detailed methylation mapping of the 5'CpG island by bisulfite- 
PCR revealed that methylation of a region 300 to 800 base pairs upstream of the 
5 translation initiation site closely correlated with the inactivation of CACNA1G. 
Aberrant methylation of CACNA1G was also examined in various human primary 
tumors, and was detected in 17 of 49 (35%) colorectal cancers, 4 of 16 (25%) gastric 
cancers, and 3 of 23 (13%) acute myelogenous leukemia cases. While not wanting to 
be bound by a particular theory, it is believed that inactivation of CACNA1G may 
10 play a role in cancer development by modulating calcium signaling, which potentially 
affects cell proliferation and apoptosis. 

Thus, in one embodiment of the present invention, there are provided nucleic 
acids comprising the coding region for a T-type calcium channel and regulatory 
15 sequences associated therewith. Specifically, the T-type calcium channel and 
associated regulatory sequences comprise CACNA1G. In a more preferred 
embodiment, the CACNA1G is the human form of the gene. An exemplary 
CACNA1G gene and associated regulatory sequences is set forth in SEQ ID NO:51. 

20 The invention provides methylated and unmethylated nucleic acid encoding 

CACNA1G (SEQ ID NO:5 1). Polynucleotides include DNA, cDNA and RNA 
sequences which encode CACNA1G polypeptide (SEQ ID NO:52). It is understood 
that naturally occuring, synthetic, and intentionally manipulated polynucleotides are 
included. For example, CACNA1G nucleic acid may be subject to site-directed 

25 mutagenesis, or the like. The nucleic acid sequences for CACNA1G also include 
antisense sequences, and sequences encoding dominant negative forms of 
CACNA1G, as well as sequences encoding functional fragments thereof. It is 
understood that naturally occurring, synthetic, and intentionally manipulated 
polynucleotides are included. 
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Methylated nucleic acid sequences are also provided. For the first time, the 
present invention provides methylated chemical structures for the following genes: 
APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
PITX2, PTCH, SDC1, and SDC4. One of skill in the art can now readily locate the 
5 CpG-rich sequences associated with these genes and identify such methylated forms 
of the genes/regulatory sequences by methods described herein (The gene sequences 
can be identified in a gene database found at 

http://www.ncbi.nim.nih.gov/UniGene/index.html). The invention provides CpG-rich 
regions from the above genes as set forth in SEQ ID Nos: 105-1 19. Thus, in yet 

10 another embodiment, the invention provides an isolated nucleic acid molecule having 
at least one methylated Cytosine of a CpG dinucleotide in a CpG-rich region and 
encoding a gene selected from APOB, CACNA1G, CDX2 5 EGFR, FBN1, GPR37, 
HSPA6, IQGAP2, KL, PAR2, PITX2, PTCH, SDC1, and SDC4. The methylated C 
residue of a CpG dinucleotide is located within a CpG-rich region selected from SEQ 

15 ID NO: 105-1 18 and SEQ ID NO: 119. 

The polynucleotides of the invention include "degenerate variants" which are 
sequences that encode the same polypeptide yet vary in sequence as a result of the 
genetic code. There are 20 natural amino acids, most of which are specified by more 
20 than one codon. Therefore, all degenerate nucleotide sequences are included in the 
invention as long as the amino acid sequence of a polypeptide encoded by the 
nucleotide sequence of SEQ ID NO: 51 is functionally unchanged. 

The term "polynucleotide 5 ^ "nucleic acid sequence" refers to a polymeric 
25 form of nucleotides at least 10 bases in length. An "isolated polynucleotide" is a 

polynucleotide that is not immediately contiguous with both of the coding sequences 
with which it is immediately contiguous (one on the 5' end and one on the 3 f end) in 
the naturally occurring genome of the organism from which it is derived. Thus, an 
isolated polynucleotide may include a coding region with its associated regulatory 
30 sequences. The term therefore includes, for example, a recombinant DNA which is 
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incorporated into a vector; into an autonomously replicating plasmid or virus; or into 
the genomic DNA of a prokaryote or eukaryote, or which exists as a separate 
molecule (e.g., a cDNA) independent of other sequences. The nucleotides of the 
invention can be ribonucleotides, deoxyribonucleotides, or modified forms of either 
5 nucleotide. Specifically, methylated forms of nucleotides in a polynucleotide 

sequence, such as regions 1-8 of CACNA1G as described herein, are also included. 
The term includes single and double forms of DNA. 

As will be understood by those of skill in the art, when the sequence is KNA, 
10 the deoxynucleotides A, G, C, and T of SEQ ID NO: 5 1, are replaced by 

ribonucleotides A, G, C, and U, respectively. Also included in the invention are 
fragments of the above-described nucleic acid sequences that are at least 15 bases in 
length, which is sufficient to permit the fragment to selectively hybridize to DNA that 
encodes SEQ ID NO:52. The term "selectively hybridize" refers to hybridization 
15 under moderately or highly stringent conditions (See, Maniatis, as cited herein) which 
excludes non-related nucleotide sequences. 

The CACNA1G nucleic acid sequence includes the disclosed sequence and 
sequences that encode conservative variations of the polypeptides encoded by 

20 CACNA1G polynucleotide provided herein. The term "conservative variation" as 

used herein denotes the replacement of an amino acid residue by another, biologically 
similar residue. Examples of conservative variations include the substitution of one 
hydrophobic residue such as isoleucine, valine, leucine or methionine for another, or 
the substitution of one polar residue for another, such as the substitution of arginine 

25 for lysine, glutamic for aspartic acid, or glutamine for asparagine, and the like. The 
term "conservative variation" also includes the use of a substituted amino acid in 
place of an unsubstituted parent amino acid provided that antibodies raised to the 
substituted polypeptide also immunoreact with the unsubstituted polypeptide. 

30 



WO 01/019845 



PCT/US00/25479 



CACNA1G nucleic acid sequences can be expressed in vitro by DNA transfer 
into a suitable host cell. "Host cells" are cells in which a vector can be propagated 
and its DNA expressed. The cell may be prokaryotic or eukaryotic. The term also 
includes any progeny of the subject host cell. It is understood that all progeny may 
5 not be identical to the parental cell since there may be mutations that occur during 
replication. However, such progeny are included when the term "host cells" is used. 
Methods of stable transfer, meaning that the foreign DNA is continuous-ly maintained 
in the host, are known in the art. 

10 In one aspect, the CACNA1G nucleic acid sequences may be inserted into an 

expression vector. The term "expression vector" refers to a plasmid, virus or other 
vehicle known in the art that has been manipulated by insertion or incorporation of the 
sequence of interest genetic sequences. Polynucleotide sequence which encode 
sequence of interest can be operatively linked to expression control sequences. 

15 "Operatively linked" refers to a juxtaposition wherein the components so described 
are in a relationship permitting them to function in their intended manner. An 
expression control sequence operatively linked to a coding sequence is ligated such 
that expression of the coding sequence is achieved under conditions compatible with 
the regulatory or expression control sequences. As used herein, the terms "regulatory 

20 sequences" and "expression control sequences" refers to nucleic acid sequences that 
regulate the expression of a nucleic acid sequence to which it is operatively linked. 
Expression control sequences are operatively linked to a nucleic acid sequence when 
the expression control sequences control and regulate the transcription and, as 
appropriate, translation of the nucleic acid sequence. Thus expression control 

25 sequences can include appropriate promoters, enhancers, transcription terminators, a 
start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, 
maintenance of the correct reading frame of that gene to permit proper translation of 
mRNA, and stop codons. The terms "regulatory sequences" and "expression control 
sequences" are intended to included, at a minimum, components whose presence can 

30 influence expression, and can also include additional components whose presence is 
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advantageous, for example, leader sequences and fusion partner sequences. An 
example of an expression control sequence includes a promoter. 

A "promoter" is a minimal sequence sufficient to direct transcription. Also 
5 included in the invention are those promoter elements which are sufficient to render 
promoter-dependent gene expression controllable for cell-type specific, tissue- 
specific, or inducible by external signals or agents; such elements may be located in 
the 5' or 3 f regions of the gene. Both constitutive and inducible promoters, are 
included in the invention (see, e.g., Bitter et al., Methods in Enzymology 153 :516- 

10 544, 1987). For example, when cloning in bacterial systems, inducible promoters 
such as pL of bacteriophage plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like 
may be used. When cloning in mammalian cell systems, promoters derived from the 
genome of mammalian cells (e.g., metallothionein promoter) or from mammalian 
viruses (e.g., the retrovirus long terminal repeat; the adenovirus late promoter; the 

15 vaccinia virus 7.5K promoter) may be used. Promoters produced by recombinant 
DNA or synthetic techniques may also be used to provide for transcription of the 
nucleic acid sequences of the invention. 

In the present invention, the CACNA1G polynucleotide sequence may be 
20 inserted into an expression vector which contains a promoter sequence which 

facilitates the efficient transcription of the inserted genetic sequence of the host. The 
expression vector typically contains an origin of replication, a promoter, as well as 
specific genes which allow phenotypic selection of the transformed cells. Vectors 
suitable for use in the present invention include, but are not limited to the T7-based 
25 expression vector for expression in bacteria (Rosenberg et al., Gene 56: 125, 1987), the 
pMSXND expression vector for expression in mammalian cells (Lee and Nathans, J. 
Biol. Chem. 263:3521, 1988) and baculovirus-derived vectors for expression in insect 
cells. The DNA segment can be present in the vector operably linked to regulatory 
elements, for example, a promoter (e.g., T7, metallothionein I, or polyhedron 
30 promoters). 
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CACNA1G polynucleotide sequences can be expressed in either prokaryotes 
or eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. 
Methods of expressing DNA sequences having eukaryotic or viral sequences in 
prokaryotes are well known in the art. Biologically functional viral and plasmid DNA 
5 vectors capable of expression and replication in a host are known in the art. Such 
vectors are used to incorp-orate DNA sequences of the invention. 

"Transformation" means a genetic change induced in a cell following 
incorporation of new DNA (i.e., DNA exogenous to the cell). Where the cell is a 
10 mammalian cell, the genetic change is generally achieved by introduction of the DNA 
into the genome of the cell (i.e., stable). 

Thus, a "transformed cell" is a cell into which (or into an ancestor of which) 
has been introduced, by means of recombinant DNA techniques, a DNA molecule 

15 encoding sequence of interest. Transformation of a host cell with recombinant DNA 
may be carried out by conventional techniques as are well known to those skilled in 
the art. Where the host is prokaryotic, such as E. coli, competent cells which are 
capable of DNA uptake can be prepared from cells harvested after exponential growth 
phase and subsequently treated by the CaC12 method using procedures well known in 

20 the art. Alternatively, MgC12 or RbCl can be used. Transformation can also be 
performed after forming a protoplast of the host cell if desired. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate co-precipitates, conventional mechan-ical procedures such as 

25 microinjection, electroporation, insertion of a plasmid encased in liposomes, or virus 
vectors may be used. Eukaryotic cells can also be cotransformed with DNA sequenc- 
es encoding the sequence of interest, and a second foreign DNA molecule encoding a 
selectable phenotype, such as the herpes simplex thymidine kinase gene. Another 
method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine 

30 papilloma virus, to transiently infect or transform eukary otic cells and express the 
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protein (see for example, Eukaryotic Viral Vectors . Cold Spring Harbor Laboratory, 
Gluzmaned., 1982). 

Methods which are well known to those skilled in the art can be used to 
5 construct expression vectors containing the CACNA1G coding sequence and 
appropriate transcriptional/translational control signals. These methods include in 
vitro recombinant DNA techniques, synthetic techniques, and in vivo 
recombination/genetic techniques. See, for example, the techniques described in 
Maniatis, et al., 1989 Molecular Cloning A Laboratory Manual, Cold Spring Harbor 
10 Laboratory, N.Y. 

A variety of host-expression vector systems may be utilized to express the 
CACNA1G coding sequence. These include but are not limited to microorganisms 
such as bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or 

15 cosmid DNA expression vectors containing the CACNA1G coding sequence; yeast 
transformed with recombinant yeast expression vectors containing the CACNA1G 
coding sequence; plant cell systems infected with recombinant virus expression 
vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors (e.g., Ti plasmid) 

20 containing the CACNA1G coding sequence; insect cell systems infected with 

recombinant virus expression vectors (e.g., baculo virus) containing the CACNA1G 
coding sequence; or animal cell systems infected with recombinant virus expression 
vectors (e.g., retroviruses, adenovirus, vaccinia virus) containing the CACNA1G 
coding sequence, or transformed animal cell systems engineered for stable expression. 

25 Since CACNA1G has not been confirmed to contain carbohydrates, both bacterial 
expression systems as well as those that provide for translational and post- 
translational modifications may be used; e.g., mammalian, insect, yeast or plant 
expression systems. 

30 Depending on the host/vector system utilized, any of a number of suitable 

transcription and translation elements, including constitutive and inducible promoters, 
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transcription enhancer elements, transcription terminators, etc. may be used in the 
expression vector (see e.g., Bitter, et al., Methods in Enzymology 153:516-544, 1987). 
For example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage .gamma., plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be 
5 used. When cloning in mammalian cell systems, promoters derived from the genome 
of mammalian cells (e.g., metallothionein promoter) or from mammalian viruses(e.g., 
the retrovirus long terminal repeat; the adenovirus late promoter; the vaccinia virus 
7.5K promoter) may be used. Promoters produced by recombinant DNA or synthetic 
techniques may also be used to provide for transcription of the inserted CACNA1G 
10 coding sequence. In addition, the endogenous CACNA1G promoter may also be used 
to provide transcription machinery of CACNA1G . 

In bacterial systems a number of expression vectors may be advantageously 
selected depending upon the use intended for the expressed. For example, when large 

15 quantities of CACNA1G are to be produced, vectors which direct the expression of 
high levels of fusion protein products that are readily purified may be desirable. Those 
which are engineered to contain a cleavage site to aid in recovering are preferred. 
Such vectors include but are not limited to the E. coli expression vector pUR278 
(Ruther, et aL, EMBO J. 2:1791, 1983), in which the CACNA1G coding sequence 

20 may be ligated into the vector in frame with the lac Z coding region so that a hybrid - 
lac Z protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res., 
13:3101-3109, 1985; Van Heeke & Schuster, J. Biol. Chem. 264:5503-5509, 1989); 
glutathione-S-transferase (GST) and the like. 

25 In yeast, a number of vectors containing constitutive or inducible promoters 

may be used. For a review see, Current Protocols in Molecular Biology, Vol. 2, 1988, 
Ed. Ausubel, et al., Green Publish. Assoc. & Wiley Interscience, Ch. 13; Grant, et al., 
1987, Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Eds. 
Wu & Grossman, 31987, Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, 

30 DNA Cloning, Vol. II, IRL Press, Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous 
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Gene Expression in Yeast, Methods in Enzymology, Eds. Berger & Kimmel, Acad. 
Press, N.Y., Vol. 152, pp. 673-684; and The Molecular Biology of the Yeast 
Saccharomyces, 1982, Eds. Strathern, et al., Cold Spring Harbor Press, Vols. I and II. 
A constitutive yeast promoter such as ADH or LEU2 or an inducible promoter such as 
5 GAL may be used (Cloning in Yeast, Ch. 3, R. Rothstein In: DNA Cloning Vol. 1 1 , A 
Practical Approach, Ed. DM Glover, 1986, IRL Press, Wash., D.C.). Alternatively, 
vectors may be used which promote integration of foreign DNA sequences into the 
yeast chromosome. 

10 In cases where plant expression of the CACNA1G coding sequence may be 

driven by any of a number of promoters. For example, viral promoters such as the 35S 
RNA and 19S RNA promoters of CaMV (Brisson, et al., Nature 310:511-514, 1984), 
or the coat protein promoter to TMV (Takamatsu, et al., EMBO J.6:307-31 1, 1987) 
may be used; alternatively, plant promoters such as the small subunit of RUBISCO 

15 (Coruzzi, et al., EMBO J.3:167M680, 1984; Broglie, et al., Science 224:838-843, 
1984); or heat shock promoters, e.g., soybean hspl7.5-E or hspl7.3-B (Gurley, et al., 
Mol. Cell. Biol. 6:559-565, 1986) may be used. These constructs can be introduced 
into plant cells using Ti plasmids, Ri plasmids, plant virus vectors, direct DNA 
transformation, microinjection, electroporation, etc. For reviews of such techniques 

20 see, for example, Weissbach & Weissbach, 1988, Methods for Plant Molecular 

Biology, Academic Press, N.Y., Section VIII, pp. 421-463; and Grierson & Corey, 
1988, Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

An alternative expression system which could be used to express is an insect 
25 system. In one such system, Autographa californica nuclear polyhedrosis virus 

(AcNPV) is used as a vector to express foreign genes. The virus grows in Spodoptera 
frugiperda cells. The CACNA1G coding sequence may be cloned into non-essential 
regions (for example the polyhedrin gene) of the virus and placed under control of an 
AcNPV promoter (for example the polyhedrin promoter). Successful insertion of the 
30 CACNA1G coding sequence will result in inactivation of the polyhedrin gene and 
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production of non-occluded recombinant virus (i.e., virus lacking the proteinaceous 
coat coded for by the polyhedrin gene). These recombinant viruses are then used to 
infect Spodoptera frugiperda cells in which the inserted gene is expressed, (e.g., see 
Smith, et al., 1983, J. Viol. 46:584; Smith, U.S. Pat No. 4,215,051). 

5 

Eukaryotic systems, and preferably mammalian expression systems, allow for 
proper post-translational modifications of expressed mammalian proteins to occur. 
Eukaryotic cells which possess the cellular machinery for proper processing of the 
primary transcript, glycosylation, phosphorylation, and advantageously, secretion of 
10 the gene product may be used as host cells for the expression of CACNA1G . 

Mammalian cell lines may be preferable. Such host cell lines may include but are not 
limited to CHO, VERO, BHK, HeLa, COS, MDCK, -293, and W138. 

Mammalian cell systems which utilize recombinant viruses or viral elements 

15 to direct expression may be engineered. For example, when using adenovirus 

expression vectors, the CACNA1G coding sequence may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome 

20 (e.g., region El or E3) will result in a recombinant virus that is viable and capable of 
expressing the protein in infected hosts (e.g., see Logan & Shenk, Proc. Natl. Acad. 
Sci. USA, 81:3655-3659, 1984). Alternatively, the vaccinia virus 7.5K promoter may 
be used (e.g., see, Mackett, et al., 1982, Proc. Natl. Acad. Sci. USA 79:7415-7419; 
Mackett, et al., J. Virol. 49:857-864, 1984; Panicali, et al., Proc. Natl. Acad. Sci. USA 

25 79:4927-493 1, 1982). Of particular interest are vectors based on bovine papilloma 

virus which have the ability to replicate as extrachromosomal elements (Sarver, et aL, 
Mol. Cell. Biol. 1:486, 1981). Shortly after entry of this DNA into mouse cells, the 
plasmid replicates to about 100 to 200 copies per cell. Transcription of the inserted 
cDNA does not require integration of the plasmid into the host's chromosome, thereby 

30 yielding a high level of expression. These vectors can be used for stable expression by 
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including a selectable marker in the plasmid, such as, for example, the neo gene. 
Alternatively, the retroviral genome can be modified for use as a vector capable of 
introducing and directing the expression of the CACNA1G gene in host cells (Cone 
& Mulligan, Proc. Natl. Acad. Sci. USA 81:6349-6353, 1984). High level expression 
5 may also be achieved using inducible promoters, including, but not limited to, the 
metallothionine IIA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. Rather than using expression vectors which contain viral 

10 origins of replication, host cells can be transformed with the CACNA1G cDNA 
controlled by appropriate expression control elements (e.g., promoter, enhancer, 
sequences, transcription terminators, polyadenylation sites, etc.), and a selectable 
marker. The selectable marker in the recombinant plasmid confers resistance to the 
selection and allows cells to stably integrate the plasmid into their chromosomes and 

15 grow to form foci which in turn can be cloned and expanded into cell lines. For 

example, following the introduction of foreign DNA, engineered cells maybe allowed 
to grow for 1-2 days in an enriched media, and then are switched to a selective media. 
A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigler, et al., Cell, 1 1 :223, 1977), hypoxanthine- 

20 guanine phosphoribosyltransferase (Szybalska & Szybalski, Proc. Natl. Acad. Sci. 
USA, 48:2026, 1962), and adenine phosphoribosyltransferase (Lowy, et al., Cell, 22: 
817, 1980) genese can be employed in tk.sup.-, hgprt.sup.- or aprt.sup.- cells 
respectively. Also, antimetabolite resistance can be used as the basis of selection for 
dhfr, which confers resistance to methotrexate (Wigler, et al., Natl. Acad. Sci. USA, 

25 77:3567, 1980; O'Hare, et ah, Proc. Natl. Acad. Sci. USA, 78: 1527, 1981); gpt, which 
confers resistance to mycophenolic acid (Mulligan & Berg, Proc. Natl. Acad. Sci. 
USA, 78: 2072, 1981; neo which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin, et aL, J. Mol. Biol., 150:1, 1981); andhygro, which confers 
resistance to hygromycin (Santerre, et al., Gene, 30:147, 1984) genes. Recently, 

30 additional selectable genes have been described, namely trpB, which allows cells to 
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utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, Proc. Natl. Acad. Sci. USA, 85:8047, 1988); 
and ODC (ornithine decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue L., 
5 1987, In: Current Communications in Molecular Biology, Cold Spring Harbor 
Laboratory, ed.). 



Isolation and purification of microbial expressed polypeptide, or fragments 
thereof, provided by the invention, may be carried out by conventional means 
10 including preparative chromatography and immunological separations involving 
monoclonal or polyclonal antibodies. 



In one embodiment, the invention provides substantially purified polypeptide 
encoded by CACNA1G polynucleotide sequences. Exemplary CACNA1G 

15 polypeptide is set forth in SEQ ID NO:52. The term "substantially purified" as used 
herein refers to a polypeptide which is substantially free of other proteins, lipids, 
carbohydrates or other materials with which it is naturally associated. One skilled in 
the art can purify a polypeptide encoded by CACNA1G polynucleotide sequence 
using standard techniques for protein purification. The substantially pure polypeptide 

20 will yield a single major band on a non-reducing polyacrylamide gel. The purity of the 
CACNA1G polypeptide can also be determined by amino-terminal amino acid 
sequence analysis. 



Minor modifications of the CACNA1G primary amino acid sequences may 
25 result in proteins which have substantially equivalent activity as compared to the 
unmodified counterpart polypeptide described herein. Such modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous. All of the 
polypeptides produced by these modifications are included herein as long as the 
biological activity still ex-ists. 



30 



WO 01/019845 



PCT/US00/25479 



22 

The polypeptides of the invention also include dominant negative forms of the 
CACNA1G polypeptide which do not have the biological activity of CACNA1G 
polypeptide sequence. A "dominant negative form" of CACNA1G is a polypeptide 
that is structurally similar to CACNA1G polypeptide but does not have wild-type 
5 CACNA1G function. For example, a dominant-negative CACNA1G polypeptide 
may interfere with wild-type CACNA1G function by binding to, or otherwise 
sequestering, regulating agents, such as upstream or downstream components, that 
normally interact functionally with the CACNA1G polypeptide. 

10 Identification and Isolation of CACNA1G 

To identify genes differentially methylated in colorectal cancer, methylated 
CpG island amplification was used followed by representational difference analysis 
(Razin and Cedar, Cell 17: 473-476, 1994, herein incorporated by reference). One of 

15 the clones recovered (MINT31, see U.S. Patent Application Serial No. 09/309,175, 
incorporated by reference herein in its entirety) mapped to human chromosome 17q21 
using a radiation hybrid panel. A Blast search revealed this fragment to be completely 
identical to part of a BAC clone (Genbank: AC004590) sequenced by high throughput 
genomic sequencing. The region surrounding MINT31 fulfills the criteria of a CpG 

20 island: GC content 0.67, CpG/GpC ratio 0.78 and a total of 305 CpG sites in a 4 kb 
region. Using this CpG island and 10 kb of flanking sequences in a Blast analysis, 
several regions highly homologous to the rat T-type calcium channel gene, 
CACNA1G, were identified (Perez-Reyes et al., Nature 3£1: 896-900. 1998, herein 
incorporated by reference). Several ESTs were also identified in this region. Using 

25 Genscan, 2 putative coding sequences (Gl, and G2) were identified. Blastp analysis 
revealed that Gl has a high homology to the EH-domain-binding protein, epsin, while 
G2 is homologous to a C. elegans hypothetical protein (accession No. 2496828). 

The MINT31 CpG island corresponds to the 3 1 regions of Gl and G2, based on 
30 the direction of the open reading frame and the presence of a poly A tail, and, without 
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being bound by theory, is unlikely to influence their transcription. The EST closest to 
MINTS 1 (H13333) (SEQ ID NO: 120; Figure 5) was sequenced entirely and was 
found not to contain a continuous open reading frame, but a poly-adenylation signal 
was identified on one end, along with a poly A tail. These data suggest that HI 3333 
5 corresponds to the last 2 exons of an unidentified gene. MINT3 1 is in the intron of 
this gene and is, again without being bound by theory, unlikely to influence 
transcription of the unidentified gene. However, based on both promoter prediction 
(TSSG) analysis of this region and homology to the rat CACNA1G sequence, the 
MINT31 CpG island is also in the 5 f region of human CACNA1G gene and is likely 
10 to play a role in its transcriptional activity. 

The human CACNA1G sequence deposited in Genbank lacks the 5 ! region of 
the gene, when compared to the rat homologue. To determine the 5' region of human 
CACNA1G, cDNA was amplified by RT-PCR using primers based on the BAG 

15 sequence (Genbank: AC004590, herein incorporated by reference). The PGR products 
were cloned and sequenced, and the genomic organization of the gene was determined 
by comparing the newly identified sequences as well as the known sequences to the 
BAG that covers this region. CACNA1G is composed of 34 exons which span a 70 kb 
area. (See, Figure 3 A and SEQ ID NO: 52). Based on sequences deposited in 

20 Genbank, the gene has two possible 3' ends caused by alternate splicing. Human 

CACNA1G is highly homologous to rat CACNA1G with 93% identity at the protein 
level, and 89% identity at the nucleotide level. The 5 ! flanking region of human 
CACNA1G lacks TATA and CAAT boxes, which is similar to many housekeeping 
genes. A putative TFIID binding site was identified 547-556 bp upstream from the 

25 translation start site, and several other potential transcription factor binding sites such 
as API (1 site), AP2 (2 sites) and SP1 (10 sites), were identified upstream of 
CACNA1G exon 1 using the promoter prediction program, TESS. 

30 
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Methylation Analysis of CACNA1G 

The CACNA1G CpG island is 4 kb, and is larger than many typical CpG 
islands. MINTS 1 corresponds to the 5' edge of the island while CACNA1G is in the 3 f 
5 region. It is not known whether large CpG islands such as this are coordinately 
regulated with regards to protection from methylation, and aberrant methylation in 
cancer. 

To address this issue, the methylation status of the 5* region of CACNA1G 

10 was studied using bisulfite-PCR of DNA from normal tissues as well as 35 human 
cancer cell lines from colon, lung, prostate, breast and hematopoietic tumors. More 
specifically, forty-nine primary colorectal cancers, 28 colorectal adenomas, 16 
primary gastric cancers and 17 acute myelogenous leukemia samples were used for 
methylation analyses. DNA from eight colon cancer cell lines (Caco2, RKO, SW48, 

15 HCT1 16, DLD1, Lovo, SW837, HT29), 4 lung cancer cell lines (OH3, H249, H157, 
H209), 4 glioblastoma cell fines (Dauy, D283, U87, U373), 8 breast cancer cell lines 
(MB-468, MCF7, MB-231, MB-474, MB-435, MB-453, BT20, CAMA1, SKBR3), 7 
hematopoietic tumor cell lines (CEM, Raji, KG1 A, HL60, ML-l,MoIt3, K562), and 4 
prostate cancer cell lines (DU145, DUPRO, LNCAP, TSUPRL) were also 

20 investigated. The CpG island was divided into 8 regions (SEQ ID NOs: 35-42, 

respectively). The methylation status of each region was examined separately. The 
genomic DNA was treated with sodium bisulfite and PCR amplified using primers 
containing no or a minimum number of CpG sites. (For a detailed description of 
bisulfite-PCR, see, U.S. Patent No. 5,786,146, incorporated herein by reference in its 

25 entirety). Methylated alleles were detected by digesting the PCR products using 
restriction enzymes which specifically cleave sites created or retained due to the 
presence of methylated CpGs. None of the regions was methylated in normal colon, 
consistent with a uniform protection against de-novo methylation. 
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Regions 1 and 2 were frequently methylated in cancer cell lines, and behaved 
in a concordant manner with respect to methylation pattern. Indeed, these 2 regions 
were methylated in most cancer cell types except gliomas. Moreover, most cell lines 
where methylation was found methylated both regions 1 and 2. in contrast, region 3, 
5 which is less CG rich than any of the other regions, had either no methylation or very 
low levels of methylation in most cell lines. Regions 5, 6, and 7 behaved quite 
differently compared to 1-3. Methylation of these regions was less frequent than 
regions 1-2, as 22/35 cell lines had no detectable methylation there, despite often 
showing methylation of region 1-2. However, when methylation of regions 5, 6, or 7 

10 was present (in 13/35 cell lines), it affected all 3 regions in a coordinate manner, 

although to varying extents. Finally, regions 4 and 8 behaved differently again, being 
partially methylated primarily in colon and breast cell lines. Therefore, with regards to 
hypermethylation in cancer cells, the CpG rich region upstream of CACNA1G 
appears to be composed of 2 CpG islands which behave independently. MINT3 1 

15 corresponds to the upstream CpG island (island 1, regions 1 and 2), while the 5 f region 
of CACNA1G is contained in the downstream CpG island (island 2, regions 5-7). 
Regions 3, 4 and 8 correspond to the edge of these CpG islands, and behave a little 
differently than the hearts of the CpG islands, as previously described for the E-Cad 
gene (Graff, et aL, J. Biol. Chem. 272: 22322-22329, 1997). 

20 

Overall, the methylation patterns of CACNA1G fell into 5 distinct categories: 
(1) No methylation in any region (normal tissue). (2) Slight methylation of island 1 (6 
cell lines, e.g., TSU-PRL). (3) Heavy methylation of island 1 but no methylation of 
island 2 (16 cell lines, e.g., Caco2). (4) Heavy methylation of island 1 and moderate to 
25 heavy methylation of island 2 (6 cell lines, e.g., RKO and Raji). (5) High methylation 
of island 1 and low to moderate methylation of island 2 (7 cell lines, e.g., MB-231). 
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Methylation Dependent Expression of CACNA1G 

In a previous study, rat CACNA1G was shown to be expressed most 
abundantly in the brain (Perez-Reyes et al., Nature 321: 896-900. 1998). To determine 
5 the expression of CACNA1G in normal and neoplastic human cells, RT-PCR was 

performed using cDNA from various normal tissues and from a panel of 27 tumor cell 
lines. CACNA1G was expressed ubiquitously in a variety of tissues and cell lines. In 
normal tissues expression was relatively low but easily detectable, while most cell 
lines had relatively high expression of CACNA1G. However, some cell lines had 

10 negligible or totally absent levels of CACNA1G expression. The results of 
CACNA1G expression was correlated with the detailed methylation analysis 
previously described. In this analysis, a remarkable pattern emerged. Methylation of 
regions 1-4 and 8 had no effect on CACNA1G expression. However, there was a 
strong correlation between methylation of regions 5-7 and expression of the gene. In 

15 fact, all cell lines tested that lack methylation of this region strongly express the gene. 
All 6 cell lines with pattern 4 methylation studied had no detectable expression. 
Finally, the 7 cell lines with pattern 5 methylation (examples DLD-1 and MB-453) 
had variable levels of expression ranging from very low to near normal. The fact that 
patterns 3 and 5 differ significantly with regards to expression, but are almost 

20 identical with regards to methylation of all regions except 7 indicates that this area is 
important in the inactivation of CACNA1G. 

To confirm that methylation of the 5' CpG island of CACNA1G is associated 
with gene inactivation, 3 non-expressing cell lines showing pattern 4 methylation 
25 (RKO, SW48 and Raji) and 2 weakly expressing cell lines showing pattern 5 

methylation (MB-23 1 and MB-435) were treated with 1 M of the methyl-transferase 
inhibitor 5-deoxy-azacitidine. After treatment, all of these cell lines re-expressed 
CACNA1G mRNA. Consistent with re-expression, demethylation of region 7 was 
observed after 5-deoxy-azacitidine treatment. 
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De novo cytosine methylation is thought to sometimes occur in vitro during 
cell propagation (Antequera et al., Cell £2: 503-514, 1990). To determine whether the 
methylation of CACNA1G occurs in vivo, primary human tumors were examined for 
methylation of the 5' region of CACNA1G. Aberrant methylation was detected in 17 
5 out of 49 (35%) colorectal cancers, 4 out of 28 colorectal adenomas (25%), 4 out of 
16 (25%) gastric cancers and 3 out of 17 (18%) acute myelogenous leukemia cases. In 
colorectal cancers, there was a significant correlation between methylation of 
CACNA1G and methylation of pl6 (p<0.005) and hMLHl (pO.OOl), as well as a 
strong correlation with the presence of microsatellite instability, and the recently 
10 identified CpG island methylator phenotype (CIMP), indicating that CACNA1G is 
also a target for CIMP in colorectal cancer. 

To determine whether aberrant methylation of the 5 ! region of CACNA1G 
affects the expression status of this gene in primary tumors, RT-PCR was performed 
15 using cDNA from a series of colorectal adenomas. Six out of 8 cases which showed 
no methylation of region 7 expressed CACNA1G. In sharp contrast, all 5 cases that 
showed methylation of region 7 had no detectable expression of the gene. 

Thus, a human T-type calcium channel gene (CACNAIG) has been identified 
20 and cloned using the MINT3 1 sequence as a probe. The human T-type calcium 

channel gene has been determined to be a target of aberrant methylation and silencing 
in human tumors. The data show that MINT3 1 (for a representative sequence of 
MINT 1-3 3, see, US Patent Application Serial No. 09/309,175) hereby incorporated by 
reference can be used as a probe to identify genes that play a role in disorders such as 
25 cell proliferative disorders. 

Detailed analysis of the CpG island upstream of CACNAIG revealed that 
methylation 300 to 800 bp upstream of the gene closely correlated with transcriptional 
inactivation. The CACNAIG promoter is contained in a large CG rich area that is not 
30 coordinately methylated in cancer. The CpG island around MINT3 1 is much more 
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frequently methylated in cancers compared to that just upstream of CACNA1G. This 
may simply be caused by differential susceptibility to de-novo methylation between 
these two regions, with methylation of MINT31 serving as a trigger, and eventually 
spreading to CACNA1G, as described in other genes (Graff, et al., J. Biol. Chem. 
5 272 : 22322-22329, 1997). However, it is likely that these 2 regions are controlled by 
different mechanisms because (1) cell lines kept in culture for countless generations 
do not in fact spread methylation from MINT31 to CACNA1G (e.g., Caco2), (2) 
region 3 that separates the 2 islands is infrequently and sparsely methylated in cancer 
and (3) 2 cases of primary colorectal cancer were found which are methylated at the 
10 CACNA1G promoter but not at MINT3 1). Therefore, methylation of MINT3 1 

appears to be independent of methylation of CACNA1G suggesting that they are 2 
distinct CpG islands regulated by different mechanisms. 

Many CpG islands of silenced genes appear to be methylated uniformly and 
15 heavily throughout the island (e.g., Graff, et al., J. Biol. Chem. 272: 22322-22329, 

1997). In contrast the methylation patterns of the 5' region of CACNA1G (region 5-7) 
was heterogeneous in the cell lines which did not express this gene. Nevertheless, 
methylation clearly plays a role in CACNA1G repression since demethylation readily 
reactivates the gene. 

20 

The mechanism of CACNA1G methylation remains to be determined. 
Methylation was not detected in normal colon mucosa, placenta, normal breast 
epithelium and normal bone marrow, including samples from aged patients, 
suggesting that methylation of this region is specific for cell proliferative disorders 
25 such as cancer, and the like. However, there was a significant correlation between 
methylation of CACNA1G and other tumor suppressor genes such as pi 6 and 
hMLHL Thus, CACNA1G is likely a target for the recently described CIMP 
phenotype, which results in a form of epigenetic instability with simultaneous 
inactivation of multiple genes. 



30 
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T-type calcium channels are involved not only in electrophysiological rhythm 
generation but also in the control of cytosolic calcium during cell proliferation and 
cell death (reviewed in Berridge, et ah, Nature 395: 645-648, 1998). Expression of 
CACNA1G is not limited to brain and heart, indicating a likely role in other tissues in 

5 which it is expressed. It has previously been shown that Ca2+ influx via T-type 

channels is an important factor during the initial stages of cell death such as apoptosis 
(Berridge, et al., Nature 395: 645-648, 1998), ischemia (Fern, J. Neurosci. 18: 7232- 
7243, 1998) and complement-induced cytotoxicity (Newsholme, et al., Biochem. J. 
295: 773-779, 1993.). The studies culminating in the present invention indicates that 

10 impairment of voltage gated calcium channels plays an important role in cancer 
development and progression through altering calcium signaling. 

Due to the clear correlation between methylation of CpG islands and cellular 
proliferative disorders, in another embodiment of the present invention, there are 

15 provided methods for detecting a cellular proliferative disorder in a subject having or 
at risk for said cellular proliferative disorder. The method includes assaying, in 
nucleic acid-containing specimen taken from said subject, the methylation state of a 
gene or its associated regulatory regions, wherein the expression state of the gene or 
its associated regulatory regions is associated with the presence of the cellular 

20 proliferative disorder, and identifying as having a cellular proliferative disorder a 

subject that has aberrant methylation of regions of said gene. The method provides for 
detecting a cellular proliferative disorder in a subject having or at risk for said cellular 
proliferative disorder by identifying aberrantly methylation of regions of a gene when 
compared to the same regions of the gene in a subject not having said cellular 

25 proliferative disorder. 

The aberrant methylation comprises hypermethylated CpG rich regions (i.e., 
islands). In one aspect of the present invention, the CpG rich regions are associated 
with the CACNA1G gene, and affect the expression thereof in a methylation state- 
30 dependent manner. A "cell proliferative disorder" or "cellular proliferative disorder" 
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is any disorder in which the proliferative capabilities of the affected cells is different 
from the normal proliferative capabilities of unaffected cells. An example of a cell 
proliferative disorder is neoplasia. Malignant cells (i.e., cancer) develop as a result of 
a multistep process. Specific, non-limiting examples of cell proliferative disorders 
5 associated with increased methylation of CpG-islands are low grade astrocytoma, 
anaplastic astrocytoma, glioblastoma, medulloblastoma, gastric cancer, colorectal 
cancer, colorectal adenoma, acute myelogenous leukemia, lung cancer, renal cancer, 
leukemia, breast cancer, prostate cancer, endometrial cancer and neuroblastoma. 

10 A cell proliferative disorder as described herein may be a neoplasm. Such 

neoplasms are either benign or malignant. The term "neoplasm" refers to a new, 
abnormal growth of cells or a growth of abnormal cells that reproduce faster than 
normal. A neoplasm creates an unstructured mass (a tumor) which can be either 
benign or malignant. For example, the neoplasm may be a head, neck, lung, 

15 esophageal, stomach, prostate, small bowel, colon, bladder, kidney, or cervical 

neoplasm. The term "benign" refers to a tumor that is noncancerous, e.g. its cells do 
not proliferate or invade surrounding tissues. The term "malignant" refers to a tumor 
that is metastastic or no longer under normal cellular growth control. 

20 A cell proliferative disorder may be an age-associated disorder. Examples of 

age-associated disorders which are cell proliferative disorders include colon cancer, 
lung cancer, breast cancer, prostate cancer, leukemia and melanoma, amongst others. 

A "nucleic acid containing specimen' 9 includes any type of material containing 
25 a nucleic acid to be subject to invention methods. The nucleic acid may be contained 
in a biological sample. Such samples include but are not limited to any bodily fluid, 
such as a serum, urine, saliva, blood, cerebrospinal fluid, pleural fluid, ascites fluid, 
sputum, stool, or a biopsy sample. 

30 Samples or specimens include any CpG-rich DNA sequence, whatever the 

origin, as long as the sequence is detectably present in a sample. While routine 
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diagnostic tests may not be able to identify cancer cells in these samples, the method 
of the present invention identifies neoplastic cells derived from the primary tumor or 
from a metastases. The method includes non-invasive sampling (e.g., bodily fluid) as 
well as invasive sampling (e.g., biopsy). The sample of DNA of the subject may be 
5 serum, plasma, lymphocytes, urine, sputum, bile, stool, cervical tissue, saliva, tears, 
cerebral spinal fluid, regional lymph node, histopathologic margins, and any bodily 
fluid that drains a body cavity or organ. Therefore, the method provides for the non- 
invasive detection of various tumor types including head and neck cancer, lung 
cancer, esophageal cancer, stomach cancer, small bowel cancer, colon cancer, bladder 

10 cancer, kidney cancers, cervical cancer and any other organ type that has a draining 
fluid accessible to analysis. For example, neoplasia of regional lymph nodes 
associated with a primary mammary tumor can be detected using the method of the 
invention. Regional lymph nodes for head and neck carcinomas include cervical 
lymph nodes, prelaryngeal lymph nodes, pulmonary juxta-esophageal lymph nodes 

15 and submandibular lymph nodes. Regional lymph nodes for mammary tissue 

carcinomas include the axillary and intercostal nodes. Samples also include urine 
DNA for bladder cancer or plasma or saliva DNA for head and neck cancer patients. 

Any nucleic acid sample, in purified or nonpurified form, can be utilized as the 
20 starting nucleic acid or acids in accordance with the present invention, provided it 
contains, or is suspected of containing, a nucleic acid sequence containing a target 
locus (e.g., CpG-containing nucleic acid). In general, the CpG-containing nucleic 
acid is DNA. However, invention methods may employ, for example, samples that 
contain DNA, or DNA and RNA, including messenger RNA, wherein DNA or RNA 
25 may be single stranded or double stranded, or a DNA-RNA hybrid may be included in 
the sample. A mixture of nucleic acids may also be employed. The specific nucleic 
acid sequence to be detected may be a fraction of a larger molecule or can be present 
initially as a discrete molecule, so that the specific sequence constitutes the entire 
nucleic acid. It is not necessary that the sequence to be studied be present initially in a 
30 pure form; the nucleic acid may be a minor fraction of a complex mixture, such as 
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contained in whole human DNA. The nucleic acid-containing sample used for 
detection of methylated CpG may be from any source including, but not limited to, 
brain, colon, urogenital, lung, renal, pancreas, liver, esophagus, stomach, 
hematopoietic, breast, thymus, testis, ovarian, prostate and uterine tissue, and may be 
5 extracted by a variety of techniques such as that described by Maniatis, et al. 

( Molecular Cloning: a L aboratory Manual. Cold Spring Harbor, NY, pp 280, 281, 
1982). 

The nucleic acid of interest can be any nucleic acid where it is desirable to 
10 detect the presence of a differentially methylated CpG island. The CpG island 
comprises a CpG island located in a gene or regulatory region for a gene. A "CpG 
island" is a CpG rich region of a nucleic acid sequence. The nucleic acid sequence 
may include, for example, APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, 
HSPA6, IQGAP2, KL, PAR2, PITX2, PTCH, SDC1, or SDC4 (see for example 
15 Figures 4A-4F). Alternatively the nucleic acid of interest can be, for example, a 
MINTS 1 nucleic acid sequence (SEQ ID NO: 120. However, any gene or nucleic acid 
sequence of interest containing a CpG sequence can provide diagnostic information 
(i.e., via its mmethylation state) using invention methods. 

20 Moreover, these markers can also be multiplexed in a single amplification 

reaction to generate a low cost, reliable cancer screening test for many cancers 
simultaneously. A combination of DNA markers for CpG-rich regions of nucleic acid 
may be amplified in a single amplification reaction. The markers are multiplexed in a 
single amplification reaction, for example, by combining primers for more than one 

25 locus. For example, DNA from a urine sample can be amplified with three different 
randomly labeled primer sets, such as those used for the amplification of the 
CACNA1G, EGFR and PTCH loci, in the same amplification reaction. The reaction 
products are separated on a denaturing polyacrylamide gel, for example, and then 
exposed to film for visualization and analysis. By analyzing a panel of markers, there 

30 is a greater probability of producing a more useful methylation profile for a subject. 
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If the sample is impure (e.g., plasma, serum, stool, ejaculate, sputum, saliva, 
cerebrospinal fluid, or blood or a sample embedded in paraffin), it may be treated 
before amplification with a reagent effective for lysing the cells contained in the 
fluids, tissues, or animal cell membranes of the sample, and for exposing the nucleic 
5 acid(s) contained therein. Methods for purifying or partially purifying nucleic acid 
from a sample are well known in the art (e.g.,Sambrook et al., Molecular Cloning: a 
Laboratory Manual , Cold Spring Harbor Press, 1989, herein incorporated by 
reference). 

10 In order to detect a differential methylation state for a gene or CpG-containing 

region of interest, invention methods include any means known in the art for detecting 
such differential methylation. For example, detecting the differential methylation may 
include contacting the nucleic acid-containing specimen with an agent that modifies 
unmethylated cytosine, amplifying a CpG-containing nucleic acid in the specimen by 

15 means of CpG-specific oligonucleotide primers, wherein the oligonucleotide primers 
distinguish between modified methylated and nonmethylated nucleic acid, and 
detecting the methylated nucleic acid based on the presence or absence of 
amplification products produced in said amplifying step. This embodiment includes 
the PCR-based methods described in U.S. Patent No. 5,786,146, incorporated herein 

20 in its entirety. 

For the first time, the methylation state of a number of genes has been 
correlated with cell proliferative disorders. Examples of such genes, primers useful 
for identifying their methylation state, and general PGR conditions are set out in Table 
25 1. 
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TABLE 1 Bisulfite-PCR Primers 





Genes 


5 '-primer 3 '-primer 


Annealing Temperature(oC) 




APOB 


5'-gtttttgtagtttgggtttttt-3 5 


5 '-RccaacaacRccaacaac-3 ' 


57 






TaqI 






5 


CACNA1G' 


See Table 2 








R6 


5 '-tteteevetteev eataett-3 1 


5 '-acraaaaaaaaaaaaaaaaatctctt-3 ' 


A TTT4 

47 ID 






Hintl 








R7 


~" &E>&tz)E>J Qf^^^^j 


5'-ttcccctacrcccctaaaacttcc-3 


49 TD 






EcoRI 






10 


CDX2 


5 '-gtagettagagggagggatyett-3 ' 

O ^"OO O OOO OOO •/ o 


5 -aaaacaaacctcaccatactacct-3 


oU 






Taql 








EGFR 


5 '-tttgatttYgtttagtattgat-3 ' 


5 -cccttacctttcttttcct-3 


JZ 






Hinfl, Taql 








FBN1 


5 ' -tttttttattgYgttaattttg-3 ' 


111 ~ i ii, _ ^ . X ~ 1 J 

5 -tttccccacctcttcaaata-i 


54 


15 




Taql 








GPR37 


5 '-esttaggtggggtaagasttt-3 ' 


5 -aacRtttaatccaattacaaacc-3 


56 






Hinfl 








HSPA6 


5'- ttttttagtagttttgagttagagg-3' 


^9 ( l 1 l 1 1 1 1 , < i 

5 - tticatcctutacacctccc-3 


oU 






Taql 






20 


IOGAP2 


5 ' -tttatttttattttagtgttag-3 ' 


3 -ctcttcivtataacatcctac-j 


52 






Taql 








KL 


5 ' - gtagtatgtty gttagy gttt-3 ' 


5'- aaataccctaaaaaaaaccc-3 


60 






Taql 








PAR2 


5 '-YggtttttttGaaatttaattc-3 ' 


5 '-aactccRcatcctcctaaa-3 ' 


45 


25 




Hinfl, Taql 








PITX2 


5 taagtgtttygttaggtttttt-3 ' 


5'- ccaaactccactacacaataac-3 5 


60 






Taql 




60 




PTCHA 


5 s - gttggtttgttaatyggagt-3' 


5 ' - ttaccaaccraaaccatatt-3 ' 






Taql 






30 


PTCHB 


5'- aatgtgtggaatttaggga-3' 


5'- taaaacaaccaactacaacttac-3 ' 


60 






Taql 




60 




SDC1 


5'- agaggaattyggtagtagagag-3' 


5 cacaaccaaaaccaaaac-3 ' 






Taql 




56 




SDC4 


5 '-ggggatttgtttggtagtgg-3 ' 


5 '-cccgaaattccccaataaa-3 ' 


35 




Hinfl 







Note: Y = C or T, R= G or A; row 1-SEQ ID NO:l and 2; row 2-SEQ ID NO:3 and 4 and so forth 
through to SEQ ID NO:32, respectively. The gene sequences can be found in a gene database found at 
http://www.ncbi.nim.nih.gov/UniGene/index.html. 

40 

In one embodiment, the oligonucleotide primers are specifically targeted to 
CACNA1G and its associated CpG islands as described herein. Examples of 
oligonucleotides suited for deterniining the methylation state of the 8 regions of the 
two CpG islands of MINT3 1/CACNA1 G, as well as PGR conditions and useful 
45 methylation sensitive restriction endonucleases are set out in Table 2. 



WO 01/019845 



PCT/US00/25479 



35 



TABLE 2 Primers Useful For Bisulfite/PCR Analysis of CACNA1G 





U p« crt r\n 
XVCglUIl 


Primpf <?et forward /reverse 

X 1 llllv_/l O^/ L j 1U1 VY OX VJ- 1 1 1» V wl Ov 


Annealing 


Restriction 








temr>erature fcvcles^ 


enzyme 






V-GAYGGYGTAGTAGTTATTTTGTT-3' 

J VJ .fx X VJ VJ X VJ X rVVJ X -Ti-VJ J. J. jT>. X X X X VJ X -i. 


58 (3) 56 (4) 


BstUI 






V-fA TP A rr A CCCCTC ACTTTAC-3' 


54 (5), 52 (26) 


Maell 




XVCglUIJL Z, 


5 »_ TTYGGGT A TTT A T A GTTTTTTGGAG-3 1 

J X X X VJ VJ VJ X jTV XXX A X .ZTlVJ X X X X X X \_i vJJXVJ — ' 


60 (3) 58 (4\ 


TaqI 




GA/T9 


V-A ATrTAGPRCCTTCACTCACTC-3' 

J "rVri i v^ x /tlV^V-'Xx.V-/V> x x ^/Av x v^rvv x v* — ' 


56 C5 / 54 (26) 


BstUI 




Region 3 


A nG A GYGTT A A TGTGAGGTT-3 1 

J X X X rxVJ VJ -Ti-VJ X VJ X X AA X VJ X VJxxVJ VJ X X _> 


55 (3 1* 53 (4) 


Hinfl 




vJlVx.? 


V-CT A A A A A A A CCC A ATCTTAAAAAAAC-3' 


51(5), 49(26) 


Maell 




Region 4 


V-TGGATA A AGGATGTTTGGGGTTTG-3' 

«J X VJ VJ .iJ. X rXfX/XvJ VJ -£X A V_J _L _1_ _i_ VJ \J VJ VJ J- J- -L 1 — ' 


55 (5) 53 (5), 


Maell 




GM5 

VJ1YX *J 


5 ! -CCCTCCCCTTACCCCTAAATCC-3' 


51(5), 49(26) 


TaqI 




Region 5 


S'-AATYGGATTTTAGTTGTGGTTTTT^' 


60 (3), 58 (4), 


BstUI 




GM1 


5'-CACACCACAACTAAATCCCTCACT-3' 


56 (5), 54 (26) 


TaqI 


15 


Region 6 


5-TTGTGGYGTTGGYGATAGTT-3 ' 


53 (3), 51 (4), 


Hinfl 




GM6 


5 , -ACl^AAAAAAAAAAAAAAAAATCTCTT-3 , 


49 (5), 47 (26) 






Region 7 


S'-GGGGGYGTTTTTTTTYGGATTTT-S' 


55 (5), 53 (5), 


EcoRI 




GM4 


5 , -TTCCCCTACRCCCCTAAAACTTCC-3 , 


51(5), 49(26) 






Region 8 


5-GGGAGTTTGGGAGTTGTATTTTGTT-3' 


60 (3), 58 (4), 


TaqI 


20 


5'-AACCAAATTAAAAAATCAAACCCTAA-3' 


56 (5), 54 (26) 






Region 8 


5 , -GAGGGGGGATYGTAATTTTTTG-3' 


60 (3), 58 (4), 


BstUI 




5 , -CCRAAATCTCCTTATTATACCTCCAA-3' 


56 (5), 54 (26) 





25 



Region 1 F/R=SEQ ID NO:33 and 34; Region 2=SEQ ID NO:35 and 36 and so forth through 
Region 8 (TaqI)=SEQ ID NO:47 and 48 and Region 8 (BstUI)=SEQ ID NO:49 and 50. 
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Exemplary target regions (i.e., regions 1-8 of MINT31/CACNA1G) that are 
complementary to the primers listed in Table 2 are provided in Table 3: 

Table 3 - Target Sequences 



Region/ 
Primer 


Target set, forward /reverse 


SEQ ID NO 


Corresponding Primer 
SEQ ID NO 


Region 1 


5'- AACAAAATAACTACTACRCCRTC-3' 


87 


33 


F165 


5'- GTAAAGTGAGGGGTGGTGATG-3' 


88 


34 


Region 2 


5'- CTCCAAAAAACTATAAATACCCRAA-3' 


89 


35 


GM2 


5'- GAGTGAGTGAAGGYGGTAGATT-3' 


90 


36 


Region 3 


5 l - A ACCTC AC ATT AACRCTCCTA AA-3 ' 


91 


37 


GM3 


5'- GTTTTTTTAAGATTGGGTTTTTTTAG-3 ' 


92 


38 


Region 4 


5'- CAAACCCCAAACATCCTTTATCCA-3' 


93 


39 


GM5 


5'- GGATTTAGGGGTAAGGGGAGGG-3 * 


94 


40 


Region 5 


5'- AAAAACCACAACT AAAATCCRATT-3 ' 


95 


41 


GM1 


5'- AGTGAGGGATTTAGTTGTGGTGTG-3' 


96 


42 


Region 6 


5 1 - AACTATCRCCAACRCCACAA-3 1 


97 


43 


GM6 


5'- AAGAGATTTTTTTTTTTTTTTTTYGT-3 1 


98 


44 


Region 7 


5'- AAAATCCRAA AAAAAACRCCCCC-3 ' 


99 


45 


GM4 


5'- GGAAGTTTTAGGGGYGTAGGGGAA-3 1 


100 


46 


Region 8 


5'- AAC AAAATACAACTCCCAAACACCC-3 ' 


101 


47 




5'- TTAGGGTTTGATTTTTTAATTTGGTT-3' 


102 


48 


Region 8 


5'- CAAAAAATTACRATCCCCCCTC-3 ' 


103 


49 




5 L TTGGAGGTATAATAAGGAGATTTYGG- 
3' 


104 


50 
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TABLE 4 Targets for Bisulfite-PCR Primers 



Genes 




SEQ ID 
NO 


Corresponding 
Primer 

QTJpv m XT P* 


APOB 5 -target 


C» A A A A A A PPP A A A PT A P A A A A A P "3 5 


55 


1 


3 '-target 


c» p TpHrp^rT^p pt> PTTP^TTfim? Q ' 
j Cr 1 1 Lr 1 1 CCKCr 1 lul 1 Ovjriv -d 


DO 


o 
z 


Ko 5 -target 


J - AAL 1 A1LY LLAAL Y LLALAA -J 


5 / 


3 


3 '-target 


O A A P A P A ' 1 " 1 " 1 " 1 " 1 " 1 " i " 1 " i " 1 " t " 1 " 1 " 1 " 1 " 1 " 1 '*D P T OJ 

j - AACACA1 111111111111111 1K.Lt! -3 


<o 

JO 




-K/ j -target 


ci AAA ATPfVA A A A A A A A PVPPPPP Q t 

J - AAAA 1 LL Y AAAAAAAAL 1 LLLLL —3 


jy 




3 '-target 


ci p p a a p t' i " t " r Apr pp. o p_t A p p p p a a 
j - CCAAG 1111 ACCCCrKLr 1 ACrCCCAA -3 


£Lf\ 

ou 


0 


LDaz 5 -target 


C ) A A P"V A T^/^/^/~^nrO/^/^ r F l /^ 1, T 1 A A PPT A P O » 

j - AALYAiLLLlLLLlUlAALLlAL -3 


ol 


o 


3 '-target 


c ■> APPTAPTA TrriTH A flfiTTTPTTTT 1. ' 

j - AOC 1 Ad 1 A 1 LrC 1 CrAOC 111C1111 -3 


/CO 

oZ 


o 


HCrK. 3 -target 


f> A TP A ATA PT AAA PT> AAA TP AAA 1 ' 

J - Al LAA 1 At 1 AAALKAAA 1 LAAA -3 


o3 


Q 

7 


3 '-target 


c 5 a PP AAAAPAAA PP-T A A PiPiP 1 » 

J - AuuAAAALrAAAuu 1 /Y/YLjCC -3 


04 


1 n 

1U 


T?D\T1 O ~.^- 
.rl3INl 5 -target 


< ' P A A A A TT A A PO PAATAAAAAAA Q ' 

3 - LAAAAi 1 AALKLAA 1 AAAAAAA -3 


OJ 


1 1 
1 1 


3 '-target 


< ' HP A 'IT'TT A A P A PPTPPPPiA A A Q ' 

J - 1 A 1 1 1 UAACACrCr 1 CrCCCAAA -3 


oo 


1 0 
1Z 


GPRS 7 5 -target 


C» A A A PTrtTTP A /^PPP A PPT A A PP O ? 

5 - AAAC 1 CI 1 ALLLLALL 1 AAtC -3 


6/ 


13 


3 '-target 


C» PPTTTPT A A TTPP A TT A A A A/"/" 1 TIP Q ' 

5 - CjCjI 1 1G1AA1 ICCrAl IAAAYCjI 1 -3 


/CO 
Oo 


14 


t TOT1 A ^ C 4_„ „ , _ 4. 

HSPA6 5 -target 


C » PP A PT A A PTP AAAA PT AAAAAA 1 ' 

5 - LCAL 1 AAL 1 LAAAAt 1 AAAAAA —3 




1 j 


3 '-target 


C PPP A PPTPT A A A A PP A TP AAA O ' 

J - CCCrACrC 1 Cr 1 AAAAOCA 1 OAAA —3 


/U 


1 /C 
10 


IQGAP2 5 -target 


C ' PT 1 A A P A PT AAAATAAAAATAAA O > 

5 - CI AAC AC 1 AAAA 1 AAAAA 1 AAA —3 


O 1 

/ 1 


1 o 
1 / 


3 '-target 


C» P T A P P A TP ' f" 1 ' A T A VP A APAP Q ' 

5 - GIACjCtAICjI 1 Al AYCrAAUAC — 3 


oo 


1 Q 
15 


K_L 5 -target 


r) AAA PT> PT A A PD A A P A T A PT A P 1 ' 

j - AAACKC 1 AACKAACA 1 AC 1 AC —3 


Id 


1 Q 

iy 


3 '-target 


r 5 PPPTTTTTTTT A PPPT A TTT I 7 

D - CCC 11111111 ACCC 1 A 1 1 1 -3 


1 A 

/4 


on 
zu 


rAKi 5 -target 


C. j P A A TT AAA TTTP AAAAAA PPD 1 * 

3 - Cr AA 1 1 AAA 111 C A AAA AAC Civ- 3 


o< 


01 
Zl 


3' -target 


c 5 TTT A PP, A PP, A TPVPP A PJTT 1 ' 

J - i 1 1 AC Cr AC Cr A ICY uuAu 11-3 


O/C 

/o 


oo 

zz 


PTTV9 V tarcr<=»t 
J: 1 J. J -large L 


V- A A A A A APPTA APT* A A APAPTTA -V 


77 


23 


3 '-target 


5'- GTTATTGTGTAGTGGAGTTTGG -3' 


78 


24 


PTCHA 5 '-target 


5'- ACTCCRATTAACAAACCAAC -3' 


79 


25 


3 '-target 


5'- AATATGGTTTYGGTTGGTAA -3' 


80 


26 


PTCHB 5 5 -target 


5'- TCCCTAAATTCCACACATT -3' 


81 


27 


3 '-target 


5'- GTAAGTTGTAGTTGGTTGTTTTA -3' 


82 


28 


SDC1 5'-target 


5'- CTCTCTACTACCRAATTCCTCT -3' 


83 


29 


3 '-target 


5'- GTTTTGGTTTTGGTTGTG -3' 


84 


30 


SDC4 5 '-target 


5'- CCACTACCAAACAAATCCCC -3' 


85 


31 


3 '-target 


5'- TTTATTGGGGAATTTCGGG -3' 


86 


32 
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Table 5 New genes differentially methylated in disease versus normal issue 



Gene 


uene name 


IVTap 


TTninana 

unigene 
Entry 1 


IVTpfhvlnf-pri Tn 2 
i^itJiiiy laicu M.BL 


APOB 


Apolipoprotein B 


2p24 


Hs.585 


Common Tumors 


CACNAIG 


T-type calcium 


17 








channel 








CDX2 


Caudal type homeo 


13ql2,3 


Hs.77399 


Leukemias, breast, 




box transcription 






prostate 




factor 2 








EGFR 


Epidermal Growth 


7pl2 


Hs.77432 


Leukemias, breast 




Factor Receptor 








FBN1 


Fibrillin- 1 


15q21.1 


Hs.750 


Colon, Breast, prostate, 










leukemias 


GPR37 


G protein-coupled 


7q31 


Hs.27747 


colon, breast, leukemias 




receptor 37 








HSPA6 


Heat shock 70kD 


lq 


Hs.3268 


Common tumors 




protein 6 (HSP70B 5 ) 








IQGAP2 


RasGAP-related 


5q 


Hs.78993 


Common tumors 




protein 








KL 


Klotho 




Hs.94592 


Common tumors 


PARI 


Proteinase-activated 


5ql3 


Hs. 15429 


Leukemias, breast 




receptor 2 




9 




PITX2 


Paired-like 


4q25-27 


Hs.92282 


Leukemias, prostate, 




homeodomain 






breast 




transcription factor 
2 








PTCH 3 


Patched 


9Q31 


Hs. 15952 
6 


Leukemias 


SDC1 


Syndecan 1 


2p24.1 


Hs.82109 


Leukemias 


SDC4 


Syndecan 4 


20ql2 


Hs.72082 


Leukemias 



Table 5: New genes differentially methylated in cancer and other diseases. 

1 Gene database that can be found at http://www.ncbi.nlm.nih.gov/UniGene/index.html. 

2 Examples: List is not comprehensive. 

3 Two promoters are affected. 



WO 01/019845 



PCT/US00/25479 



In another embodiment, detection of differential methylation is accomplished 
by contacting a nucleic acid sample suspected of comprising a CpG-containing 
nucleic acid with a methylation sensitive restriction endonuclease that cleaves only 
unmethylated CpG sites under conditions and for a time to allow cleavage of 

5 unmethylated nucleic acid. The sample is further contacted with an isoschizomer of 
the methylation sensitive restriction endonuclease, that cleaves both methylated and 
unmethylated CpG-sites, under conditions and for a time to allow cleavage of 
methylated nucleic acid. Oligonucleotides are added to the nucleic acid sample under 
conditions and for a time to allow ligation of the oligonucleotides to nucleic acid 

10 cleaved by the restriction endonuclease, and the digested nucleic acid is amplified by 
conventional methods such as PCR wherein primers complementary to the 
oligonucleotides are employed. Following identification, the methylated CpG- 
containing nucleic acid can be cloned, using method well known to one of skill in the 
art (see Sambrook et al., Molecular Cloning: a Laboratory Manual , Cold Spring 

15 Harbor Press, 1989). 

As used herein, a "methylation sensitive restriction endonuclease" is a 
restriction endonuclease that includes CG as part of its recognition site and has altered 
activity when the C is methylated as compared to when the C is not methylated. 

20 Preferably, the methylation sensitive restriction endonuclease has inhibited activity 
when the C is methylated (e.g., Smal). Specific non-limiting examples of a 
methylation sensitive restriction endonucleases include Sma I, BssHII, or Hpall. Such 
enzymes can be used alone or in combination. Other methylation sensitive restriction 
endonucleases will be known to those of skill in the art and include, but are not 

25 limited to SacII, EagI, and BstUI, for example. An "isoschizomer" of a methylation 
sensitive restriction endonuclease is a restriction endonuclease which recognizes the 
same recognition site as a methylation sensitive restriction endonuclease but which 
cleaves both methylated and unmethylated CGs. One of skill in the art can readily 
determine appropriate conditions for a restriction endonuclease to cleave a nucleic 

30 acid (see Sambrook et al., Molecular C lonin g: a Laboratory Manual . Cold Spring 
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Harbor Press, 1989). Without being bound by theory, actively transcribed genes 
generally contain fewer methylated CGs than in other genes. 

In one embodiment of the invention, a nucleic acid of interest is cleaved with a 
5 methylation sensitive endonuclease. In one aspect, cleavage with the methylation 
sensitive endonuclease creates a sufficient overhang on the nucleic acid of interest. 
Following cleavage with the isoschizomer, the cleavage product can still have a 
sufficient overhang. An ,l overhang ,, refers to nucleic acid having two strands wherein 
the strands end in such a maimer that a few bases of one strand are not base paired to 
10 the other strand. A "sufficient overhang" refers to an overhang of sufficient length to 
allow specific hybridization of an oligonucleotide of interest. In one embodiment, a 
sufficient overhang is at least two bases in length. In another embodiment, the 
sufficient overhang is four or more bases in length. An overhang of a specific 
sequence on the nucleic acid of interest may be desired in order for an oligonucleotide 
15 of interest to hybridize. In this case, the isoschizomer can be used to create the 
overhang having the desired sequence on the nucleic acid of interest. 

In another aspect of this embodiment, the cleavage with a methylation 
sensitive endonuclease results in a reaction product of the nucleic acid of interest that 
20 has a blunt end or an insufficient overhang. In this embodiment, an isoschizomer of 
the methylation sensitive restriction endonuclease can create a sufficient overhang on 
the nucleic acid of interest. "Blunt ends" refers to a flush ending of two stands, the 
sense stand and the antisense strand, of a nucleic acid. 

25 Once a sufficient overhang is created on the nucleic acid of interest, an 

oligonucleotide is ligated to the nucleic acid cleaved of interest which has been 
cleaved by the methylation specific restriction endonuclease. "Ligation" is the 
attachment of two nucleic acid sequences by base pairing of substantially 
complementary sequences and/or by the formation of covalent bonds between two 

30 nucleic acid sequences. In one aspect of the present invention, an "oligonucleotide" is 
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a nucleic acid sequence of about 2 up to about 40 bases in length. It is presently 
preferred that the oligonucleotide is from about 15 to 35 bases in length. 

In one embodiment, an adaptor is utilized to create DNA ends of desired 

5 sequence and overhang. An "adaptor" is a double-stranded nucleic acid sequence with 
one end that has a sufficient single-stranded overhang at one or both ends such that 
the adaptor can be ligated by base-pairing to a sufficient overhang on a nucleic acid of 
interest that has been cleaved by a methylation sensitive restriction enzyme or an 
isoschizomer of a methylation sensitive restriction enzyme. Adaptors can be obtained 

10 commercially, or two oligonucleotides can be utilized to form an adaptor. Thus, in 
one embodiment, two oligonucleotides are used to form an adaptor; these 
oligonucleotides are substantially complementary over their entire sequence except for 
the region(s) at the 5' and/or 3' ends that will form a single stranded overhang. The 
single stranded overhang is complementary to an overhang on the nucleic acid cleaved 

15 by a methylation sensitive restriction enzyme or an isoschizomer of a methylation 

sensitive restriction enzyme, such that the overhang on the nucleic acid of interest will 
base pair with the 3' or 5 ? single stranded end of the adaptor under appropriate 
conditions. The conditions will vary depending on the sequence composition (GC vs 
AT), the length, and the type of nucleic acid (see Sambrook et ah, Molecular Cloning : 

20 a Laboratory Manual . 2nd Ed.; Cold Spring Harbor Laboratory Press, Plainview, NY, 
1998). 

Following the ligation of the oligonucleotide, the nucleic acid of interest is 
amplified using a primer complementary to the oligonucleotide. Specifically, the term 

25 "primer" as used herein refers to a sequence comprising two or more deoxyribo- 

nucleotides or ribonucleotides, preferably more than three, and more preferably more 
than eight, wherein the sequence is capable of initiating synthesis of a primer 
extension product, which is substantially complementary to a nucleic acid such as an 
adaptor or a ligated oligonucleotide. Environmental conditions conducive to synthesis 

30 include the presence of nucleoside triphosphates and an agent for polymerization, 
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such as DNA polymerase, and a suitable temperature and pH. The primer is 
preferably single stranded for maximum efficiency in amplification, but may be 
double stranded. If double stranded, the primer is first treated to separate its strands 
before being used to prepare extension products. In one embodiment, the primer is an 
5 oligodeoxyribo-nucleotide. The primer must be sufficiently long to prime the 
synthesis of extension products in the presence of the inducing agent for 
polymerization. The exact length of primer will depend on many factors, including 
temperature, buffer, and nucleotide composition. The oligonucleotide primer 
typically contains 12-20 or more nucleotides, although it may contain fewer 
10 nucleotides. 

Primers of the invention are designed to be "substantially" complementary to 
each strand of the oligonucleotide to be amplified and include the appropriate G or C 
nucleotides as discussed above. This means that the primers must be sufficiently 
15 complementary to hybridize with their respective strands under conditions which 
allow the agent for polymerization to perform. In other words, the primers should 
have sufficient complementarity with a 5 f and 3' oligonucleotide to hybridize 
therewith and permit amplification of CpG containing nucleic acid sequence. 

20 Primers of the invention are employed in the amplification process which is an 

enzymatic chain reaction that produces exponential quantities of target locus relative 
to the number of reaction steps involved (e.g., polymerase chain reaction or PGR). 
Typically, one primer is complementary to the negative (-) strand of the locus and the 
other is complementary to the positive (+) strand. Annealing the primers to denatured 

25 nucleic acid followed by extension with an enzyme, such as the large fragment of 
DNA Polymerase I (Klenow) and nucleotides, results in newly synthesized + and - 
strands containing the target locus sequence. Because these newly synthesized 
sequences are also templates, repeated cycles of denaturing, primer annealing, and 
extension results in exponential production of the region (i.e., the target locus 

30 sequence) defined by the primer. The product of the chain reaction is a discrete 
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nucleic acid duplex with termini corresponding to the ends of the specific primers 
employed. 

The oligonucleotide primers of the invention may be prepared using any 
suitable method, such as conventional phosphotriester and phosphodiester methods or 
5 automated embodiments thereof. In one such automated embodiment, diethylphos- 
phoramidites are used as starting materials and may be synthesized as described by 
Beaucage, et al. (Tetrahedron Letters, 22:1859-1862, 1981). One method for 
synthesizing oligonucleotides on a modified solid support is described in U.S. Patent 
No. 4,458,066. 

10 

Where the CpG-containing nucleic acid sequence of interest contains two 
strands, it is necessary to separate the strands of the nucleic acid before it can be used 
as a template for the amplification process. Strand separation can be effected either as 
a separate step or simultaneously with the synthesis of the primer extension products. 

15 This strand separation can be accomplished using various suitable denaturing 

conditions, including physical, chemical, or enzymatic means, the word "denaturing 11 
includes all such means. One physical method of separating nucleic acid strands 
involves heating the nucleic acid until it is denatured. Typical heat denaturation may 
involve temperatures ranging from about 80° to 105°C for times ranging from about 1 

20 to 10 minutes. Strand separation may also be induced by an enzyme from the class of 
enzymes known as helicases or by the enzyme RecA, which has helicase activity, and 
in the presence of riboATP, is known to denature DNA, The reaction conditions 
suitable for strand separation of nucleic acids with helicases are described by Kuhn 
Hoffmann-Berling (CSH-Quantitative Biology, 42:63, 1978) and techniques for using 

25 RecA are reviewed in C. Radding (Ann. Rev. Genetics, 16:405-437, 1982). 

When complementary strands of nucleic acid or acids are separated, regardless 
of whether the nucleic acid was originally double or single stranded, the separated 
strands are ready to be used as a template for the synthesis of additional nucleic acid 
30 strands. This synthesis is performed under conditions allowing hybridization of 
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primers to templates to occur. Generally synthesis occurs in a buffered aqueous 
solution, generally at a pH of about 7-9. Preferably, a molar excess (for genomic 
nucleic acid, usually about 108:1 primer:template) of the two oligonucleotide primers 
is added to the buffer containing the separated template strands. It is understood, 

5 however, that the amount of complementary strand may not be known if the process 
of the invention is used for diagnostic applications, so that the amount of primer 
relative to the amount of complementary strand cannot be determined with certainty. 
As a practical matter, however, the amount of primer added will generally be in molar 
excess over the amount of complementary strand (template) when the sequence to be 

10 amplified is contained in a mixture of complicated long-chain nucleic acid strands, a 
large molar excess is preferred to improve the efficiency of the process. 

The deoxyribonucleoside triphosphates dATP, dCTP, dGTP, and dTTP are 
added to the synthesis mixture, either separately or together with the primers, in 

15 adequate amounts and the resulting solution is heated to about 90°-100°C from about 
1 to 10 minutes, preferably from 1 to 4 minutes. After this heating period, the 
solution is allowed to cool to approximately room temperature, which is preferable for 
the primer hybridization. To the cooled mixture is added an appropriate agent for 
effecting the primer extension reaction (called herein "agent for polymerization"), and 

20 the reaction is allowed to occur under conditions known in the art. The agent for 

polymerization may also be added together with the other reagents if it is heat stable. 
This synthesis (or amplification) reaction may occur at room temperature up to a 
temperature above which the agent for polymerization no longer functions. Thus, for 
example, if DNA polymerase is used as the agent, the temperature is generally no 

25 greater than about 40°C. Most conveniently the reaction occurs at room temperature. 



The agent for polymerization may be any compound or system which will 
function to accomplish the synthesis of primer extension products, including enzymes. 
Suitable enzymes for this purpose include, for example, E. coli DNA polymerase I, 
30 Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available 
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DNA polymerases, polymerase muteins, reverse transcriptase, and other enzymes, 
including heat-stable enzymes (i.e., those enzymes which perform primer extension 
after being subjected to temperatures sufficiently elevated to cause denaturation such 
as Taq DNA polymerase, and the like). Suitable enzymes will facilitate combination 
5 of the nucleotides in the proper manner to form the primer extension products which 
are complementary to each locus nucleic acid strand. Generally, the synthesis will be 
initiated at the 3' end of each primer and proceed in the 5' direction along the template 
strand, until synthesis terminates, producing molecules of different lengths. There 
may be agents for polymerization, however, which initiate synthesis at the 5* end and 
10 proceed in the other direction, using the same process as described above. 

Preferably, the method of amplifying is by PGR, as described herein and as is 
commonly used by those of ordinary skill in the art. However, alternative methods of 
amplification have been described and can also be employed. PGR techniques and 
15 many variations of PGR are known. Basic PGR techniques are described by Saiki et 
al. (1988 Science 239:487-491) and by U.S. Pat Nos. 4,683,195, 4,683,202 and 
4,800,159, which are incorporated herein by reference. 

The conditions generally required for PGR include temperature, salt, cation, 
20 pH and related conditions needed for efficient copying of the master-cut fragment. 
PGR conditions include repeated cycles of heat denaturation (i.e. heating to at least 
about 95. degree. C.) and incubation at a temperature permitting primer: adaptor 
hybridization and copying of the master-cut DNA fragment by the amplification 
enzyme. Heat stable amplification enzymes like the pwo, Thermus aquaticus or 
25 Thermococcus litoralis DNA polymerases are commercially available which eliminate 
the need to add enzyme after each denaturation cycle. The salt, cation, pH and related 
factors needed for enzymatic amplification activity are available from commercial 
manufacturers of amplification enzymes. 

30 As provided herein an amplification enzyme is any enzyme which can be used 

for in vitro nucleic acid amplification, e.g. by the above-described procedures. Such 
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amplification enzymes include pwo, Escherichia coli DNA polymerase I, Klenow 
fragment of E. coli DNA polymerase I, T4 DNA polymerase, T7 DNA polymerase, 
Thermus aquaticus (Taq) DNA polymerase, Thermococcus litoralis DNA polymerase, 
SP6 RNA polymerase, T7 RNA polymerase, T3 RNA polymerase, T4 polynucleotide 
5 kinase, Avian Myeloblastosis Virus reverse transcriptase, Moloney Murine Leukemia 
Virus reverse transcriptase, T4 DNA ligase, E. coli DNA ligase or Q.beta. replicase. 
Preferred amplification enzymes are the pwo and Taq polymerases. The pwo enzyme 
is especially preferred because of its fidelity in replicating DNA. 

10 Once amplified, the nucleic acid can be attached to a solid support, such as a 

membrane, and can be hybridized with any probe of interest, to detect any nucleic 
acid sequence. Several membranes are known to one of skill in the art for the 
adhesion of nucleic acid sequences. Specific non-limiting examples of these 
membranes include nitrocellulose (NITROPURE) or other membranes used in for 

15 detection of gene expression such as polyvinylchloride, diazotized paper and other 
commercially available membranes such as GENESCREEN, ZETAPROBE (Biorad), 
and NYTRAN Methods for attaching nucleic acids to these membranes are well 
known to one of skill in the art. Alternatively, screening can be done in a liquid 
phase. 

20 

In nucleic acid hybridization reactions, the conditions used to achieve a 
particular level of stringency will vary, depending on the nature of the nucleic acids 
being hybridized. For example, the length, degree of complementarity, nucleotide 
sequence composition (e.g., GC v. AT content), and nucleic acid type (e.g., RNA v. 
25 DNA) of the hybridizing regions of the nucleic acids can be considered in selecting 
hybridization conditions. An additional consideration is whether one of the nucleic 
acids is immobilized, for example, on a filter. 

An example of progressively higher stringency conditions is as follows: 2 x 
30 SSC/0.1% SDS at about room temperature (hybridization conditions); 0.2 x 
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SSC/0.1% SDS at about room temperature (low stringency conditions); 0.2 x 
SSC/0.1% SDS at about 42°C (moderate stringency conditions); and 0.1 x SSC at 
about 68°C (high stringency conditions). Washing can be carried out using only one 
of these conditions, e.g., high stringency conditions, or each of the conditions can be 
5 used, e.g., for 10-15 minutes each, in the order listed above, repeating any or all of the 
steps listed. However, as mentioned above, optimal conditions will vary, depending 
on the particular hybridization reaction involved, and can be determined empirically. 
In general, conditions of high stringency are used for the hybridization of the probe of 
interest. 

10 

The probe of interest can be detectably labeled, for example, with a 
radioisotope, a fluorescent compound, a bioluminescent compound, a 
chemiluminescent compound, a metal chelator, or an enzyme. Those of ordinary skill 
in the art will know of other suitable labels for binding to the probe, or will be able to 
15 ascertain such, using routine experimentation. 

In one embodiment, representational difference analysis (RDA, see Lisitsyn et 
al, Science 259:946-951, 1993, herein incorporated by reference) can be performed 
on CpG-containing nucleic acid following MCA. MCA utilizes kinetic and 

20 subtractive enrichment to purify restriction endonuclease fragments present in one 
population of nucleic acid fragments but not in another. Thus, RDA enables the 
identification of small differences between the sequences of two nucleic acid 
populations. RDA uses nucleic acid from one population as a "tester" and nucleic 
acid from a second population as a "driver" in order to clone probes for single copy 

25 sequences present in (or absent from) one of the two populations. In one embodiment, 
nucleic acid from a "normal" individual or sample, not having a disorder such as a 
cell-proliferative disorder is used as a "driver," and nucleic acid from an "affected" 
individual or sample, having the disorder such as a cell proliferative disorder is used 
as a "tester." In one embodiment, the nucleic acid used as a "tester" is isolated from 

30 an individual having a cell proliferative disorder such as low grade astrocytoma, 
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anaplastic astrocytoma, glioblastoma, medulloblastoma, gastric cancer, colorectal 
cancer, colorectal adenoma, acute myelogenous leukemia, leukemia, lung cancer, 
renal cancer, breast cancer, prostate cancer, endometrial cancer and neuroblastoma. 
The nucleic acid used as a "driver' 1 is thus normal astrocytes, normal glial cells, 
5 normal brain cells, normal gastric cells, normal colorectal cells, normal leukocytes, 
normal lung cells, normal kidney cells, normal breast cells, normal prostate cells, 
normal uterine cells, and normal neurons, respectively. In an additional embodiment, 
the nucleic acid used as a "driver" is isolated from an individual having a cell 
proliferative disorder such as low grade astrocytoma, anaplastic astrocytoma, 

10 glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal adenoma, 
acute myelogenous leukemia, leukemia, lung cancer, renal cancer, breast cancer, 
prostate cancer, endometrial cancer and neuroblastoma. The nucleic acid used as a 
"tester" is thus normal astrocytes, normal glial cells, normal brain cells, normal gastric 
cells, normal colorectal cells, normal leukocytes, normal lung cells, normal kidney 

15 cells, normal breast cells, normal prostate cells, normal uterine cells, and normal 
neurons, respectively. One of skill in the art will readily be able to identify the 
"tester" nucleic acid useful with to identify methylated nucleic acid sequences in 
given "driver" population. 

20 KITS 

The materials for use in the assay of the invention are ideally suited for the 
preparation of a kit. Therefore, in accordance with another embodiment of the present 
invention, there is provided a kit it useful for the detection of a cellular proliferative 
disorder in a subject having or at risk for said cellular proliferative disorder. 

25 Invention kits include a carrier means compartmentalized to receive a sample in close 
confinement therein, one or more containers comprising a first container containing a 
reagent which modifies unmethylated cytosine and a second container containing 
primers for amplification of a CpG-containing nucleic acid, wherein the primers 
distinguish between modified methylated and nonmethylated nucleic acid, and 

30 optionally, a third container containing a methylation sensitive restriction 
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endonuclease. Primers contemplated for use in accordance with the invention include 
those set forth in SEQ ID NOs: 1-50. 

Carrier means are suited for containing one or more container means such as 
5 vials, tubes, and the like, each of the container means comprising one of the separate 
elements to be used in the method. In view of the description provided herein of 
invention methods, those of skill in the art can readily determine the apportionment of 
the necessary reagents among the containiner means. For example, one of the 
container means can comprise a container containing an oligonucleotide for ligation to 
10 nucleic acid cleaved by a methylation sensitive restriction endonuclease. One or more 
container means can also be included comprising a primer complementary to the 
oligonucleotide. In addition, one or more container means can also be included which 
comprise a methylation sensitive restriction endonuclease. One or more container 
means can also be included containing an isoschizomer of said methylation sensitive 
15 restriction enzyme. 

In another embodiment, the kit may comprise a carrier means containing one 
or more container means comprising a solid support, wherein the solid support has a 
nucleic acid sequence of CACNA1G as described herein immobilized on the solid 

20 support. In one embodiment, the solid support is a membrane. Several membranes 
are known to one of skill in the art for the adhesion of nucleic acid sequences. 
Specific non-limiting examples of these membranes include nitrocellulose (Nitropure) 
or other membranes used in for detection of gene expression such as 
polyvinylchloride, diazotized paper and other commercially available membranes 

25 such as GENES CREEN, ZETAPROBE (Biorad), and NYTRAN The CACNA1G 
sequences immobilized on the solid support can then be hybridized to nucleic acid 
sequences produced by performing the MCA procedure, bisulfite PGR or other 
methylation detection methods on the nucleic acids of a sample of interest in order to 
determine if the nucleic acid sequences contained in the sample are methylated. 
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The term "oligonucleotide primer" refers to a sequence of two or more 
deoxyribo-nucleotides or ribonucleotides, preferably at least eight, which sequence is 
capable of initiating synthesis of a primer extension product that is substantially 
complementary to a target nucleic acid strand. The oligonucleotide primer typically 

i 

5 contains fifteen to twenty-two or more nucleotides, although it may contain fewer 
nucleotides if the primer is complementary, so as to specifically allow the 
amplification of the specifically desired target nucleotide sequence. 

The oligonucleotide primers for use in the invention may be prepared using 
10 any suitable method, such as conventional phosphotriester and phosphodiester 

methods or automated embodiments thereof. In one such automated embodiment, 
diethyl-phos-phoramidites are used as starting materials and may be synthesized as 
described by Beaucage et al., Tetrahedron Letter, 22: 1859-1862 (1981). One method 
for synthesizing oligonucleotides on a modified solid support is described in U.S. 
15 Patent No. 4,458,066. The exact length of primer will depend on many factors, 

including temperature, buffer, and nucleotide composition. The primer must prime 
the synthesis of extension products in the presence of the inducing agent for 
amplification. 

20 Primers used according to the method of the invention are complementary to 

each strand of mutant nucleotide sequence to be amplified. The term 
"complementary" means that the primers must hybridize with their respective strands 
under conditions which allow the agent for polymerization to function. In other 
words, the primers that are complementary to the flanking sequences hybridize with 

25 the flanking sequences and permit amplification of the nucleotide sequence. 

Preferably, the 3 f terminus of the primer that is extended has perfectly base paired 
complementarity with the complementary flanking strand. 



30 



The term "flanks CpG-rich regions" refers to those DNA sequences on 
chromosome that are upstream (5') or downstream (3 ! ) to the DNA sequence to be 
amplified. The sequence to be amplified is preferably a CpG-rich region in a gene or 
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regulatory region associated with a gene. For example, when the nucleotide sequence 
to be amplified is double stranded, a first sequence that is 5' to the nucleotide 
sequence and a second sequence that is 5 1 to the nucleotide sequence on the 
complementary strand flank the CpG-rich DNA sequence. 

5 

The nucleotide sequences that flank nucleotide repeats, i.e., the nucleotide 
sequences to which the oligonucleotide primers hybridize, may be selected from 
among the following nucleotide sequences: SEQ ID NO: 1-50. 

10 In general, the primers used according to the method of the invention embrace 

oligonucleotides of sufficient length and appropriate sequence which provide specific 
initiation of polymerization of a significant number of nucleic acid molecules 
containing the target nucleic acid under the conditions of stringency for the reaction 
utilizing the primers. In this manner, it is possible to selectively amplify the specific 

15 target nucleic acid sequence containing the nucleic acid of interest. Oligonucleotide 
primers used according to the invention are employed in any amplification process 
that produces increased quantities of target nucleic acid. 

The invention includes antibodies immunoreactive with CACNA1G 
20 polypeptide or functional fragments thereof. Antibody which consists essentially of 
pooled monoclonal antibodies with different epitopic specificities, as well as distinct 
monoclonal antibody preparations are provided. Monoclonal antibodies are made 
from antigen containing fragments of the protein by methods well known to those 
skilled in the art (Kohler, et aL, Nature, 25^:495, 1975). The term antibody as used in 
25 this invention is meant to include intact molecules as well as fragments thereof, such 
as Fab and F(ab% Fv and SCA fragments which are capable of binding an epitopic 
determinant on CACNA1G . 

(1) An Fab fragment consists of a monovalent antigen-binding fragment of an 
30 antibody molecule, and can be produced by digestion of a whole antibody molecule 
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with the enzyme papain, to yield a fragment consisting of an intact light chain and a 
portion of a heavy chain. 

(2) An Fab f fragment of an antibody molecule can be obtained by treating a whole 

5 antibody molecule with pepsin, followed by reduction, to yield a molecule consisting 
of an intact light chain and a portion of a heavy chain. Two Fab' fragments are 
obtained per antibody molecule treated in this manner. 

(3) An (Fab') 2 fragment of an antibody can be obtained by treating a whole antibody 
10 molecule with the enzyme pepsin, without subsequent reduction. A (Fab') 2 fragment 

is a dimer of two Fab 1 fragments, held together by two disulfide bonds. 

(4) An Fv fragment is defined as a genetically engineered fragment containing the 
variable region of a light chain and the variable region of a heavy chain expressed as 

15 two chains. 

(5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
containing the variable region of a light chain and the variable region of a heavy 
chain, linked by a suitable, flexible polypeptide linker. 

20 

As used in this invention, the term "epitope" refers to an antigenic determinant 
on an antigen, such as a CACNA1G polypeptide, to which the paratope of an 
antibody, such as an CACNA1G -specific antibody, binds. Antigenic determinants 
usually consist of chemically active surface groupings of molecules, such as amino 
25 acids or sugar side chains, and can have specific three-dimensional structural 
characteristics, as well as specific charge characteristics. 

As is mentioned above, antigens that can be used in producing CACNA1G 
-specific antibodies include CACNA1G polypeptides or CACNA1G polypeptide 
30 fragments. The polypeptide or peptide used to immunize an animal can be obtained 
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by standard recombinant, chemical synthetic, or purification methods. As is well 
known in the art, in order to increase immunogenicity, an antigen can be conjugated to 
a carrier protein. Commonly used carriers include keyhole limpet hemocyanin 
(KLH), thyreoglobulin, bovine serum albumin (BSA), and tetanus toxoid. The coupled 
5 peptide is then used to immunize the animal (e.g., a mouse, a rat, or a rabbit). In 
addition to such carriers, well known adjuvants can be administered with the antigen 
to facilitate induction of a strong immune response. 

The invention also provides a method for detecting a cell proliferative disorder 
10 associated with CACNA1G in a subject, comprising contacting a target cellular 
component suspected of having a CACNA1G associated disorder, with a reagent 
which reacts with or binds to CACNA1G and detecting CACNA1G . The target cell 
component can be nucleic acid, such as DNA or RNA, or it can be protein. When the 
component is nucleic acid, the reagent is typically a nucleic acid probe or PGR primer. 
15 When the cell component is protein, the reagent is typically an antibody probe. The 
target cell component may be detected directly in situ or it may be isolated from other 
cell components by common methods known to those of skill in the art before 
contacting with a probe. (See for example, Maniatis, et aL, Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory, N.Y. 1989; Current Protocols in 
20 Molecular Biology, 1994, Ed. Ausubel, et al., Green Publ. Assoc. & Wiley 

Interscience.) Detection methods include Southern and Northern blot analyses, RNase 
protection, immunoassays and other detection assays that are known to those of skill 
in the art. 

25 The probes can be detectably labeled, for example, with a radioisotope, a 

fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a 
metal chelator, or an enzyme. Those of ordinary skill in the art will know of other 
suitable labels for binding to the probes or will be able to ascertain such, using routine 
experimentation. 
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Since the present invention shows that a decreased level of CACNA1G 
transcription is often the result of hypermethylation of the CACNA1G gene, it is 
often desirable to directly determine whether the CACNA1G gene is 
hypermethylated. In particular, the cytosine rich areas terms "CpG islands" which lie 
5 in the 5 1 regulatory regions of genes are normally unmethylated. The term 
"hypermethylation" includes any methylation of cytosine which is normally 
unmethylated in the CACNA1G gene sequence can be detected by restriction 
endonuclease treatment of CACNA1G polynucleotide (gene) and Southern blot 
analysis for example. Therefore, in a method of the invention, when the cellular 

10 component detected is DNA, restriction endonuclease analysis is preferable to detect 
hypermethylation of the CACNA1G gene. Any restriction endonuclease that includes 
CG as part of its recognition site and that is inhibited when the C is methylated, can 
be utilized. Methylation sensitive restriction endonucleases such as BssHII, Mspl, 
NotI or Hpall, used alone or in combination are examples of such endonucleases. 

15 Other methylation sensitive restriction endonucleases will be known to those of skill 
in the art. In addition, PCR can be utilized to detect the methylation status of the 
CACNA1G gene. Oligonucleotide primers based on any coding sequence region in 
the CACNA1G sequence are useful for amplifying DNA by PCR. CACNA1G is 
described here for exemplary purposes. The other genes described herein as being 

20 For purposes of the invention, an antibody or nucleic acid probe specific for 

CACNA1G may be used to detect the presence of CACNA1G polypeptide (using 
antibody) or polynucleotide (using nucleic acid probe) in biological fluids or tissues. 
Oligonucleotide primers based on any coding sequence region in the CACNA1G 
sequence are useful for amplifying DNA, for example by PCR. Any specimen 

25 containing a detectable amount of CACNA1G polynucleotide or CACNA1G 

polypeptide antigen can be used. Nucleic acid can also be analyzed by RNA in situ 
methods which are known to those of skill in the art. A preferred sample of this 
invention is tissue of heart, renal, brain, colon, breast, urogenital, uterine, 
hematopoietic, prostate, thymus, lung, testis, and ovarian. Preferably the subject is 

30 human. 
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Various disorders which are detectable by the method of the invention include 
astrocytoma, anaplastic astrocytoma, glioblastoma, medulloblastoma, colon cancer, 
lung cancer, renal cancer, leukemia, breast cancer, prostate cancer, endometrial cancer 
and neuroblastoma. 

5 

Monoclonal antibodies used in the method of the invention are suited for use, 
for example, in immunoassays in which they can be utilized in liquid phase or bound 
to a solid phase carrier. In addition, the monoclonal antibodies in these immunoassays 
can be detectably labeled in various ways. Examples of types of immunoassays which 

10 can utilize monoclonal antibodies of the invention are competitive and non- 
competitive immunoassays in either a direct or indirect format. Examples of such 
immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) 
assay. Detection of the antigens using the monoclonal antibodies of the invention can 
be done utilizing immunoassays which are run in either the forward, reverse, or 

15 simultaneous modes, including immunohistochemical assays on physiological 
samples. Those of skill in the art will know, or can readily discern, other 
immunoassay formats without undue experimentation. 

The term "immunometric assay" or "sandwich immunoassay", includes 
20 simultaneous sandwich, forward sandwich and reverse sandwich immunoassays. 
These terms are well understood by those skilled in the art. Those of skill will also 
appreciate that antibodies according to the present invention will be useful in other 
variations and forms of assays which are presently known or which may be developed 
in the future. These are intended to be included within the scope of the present 
25 invention. 



Monoclonal antibodies can be bound to many different carriers and used to 
detect the presence of CACNA1G . Examples of well-known carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
30 modified celluloses, polyacrylamides, agaroses and magnetite. The nature of the 
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carrier can be either soluble or insoluble for purposes of the invention. Those skilled 
in the art will know of other suitable carriers for binding monoclonal antibodies, or 
will be able to ascertain such using routine experimentation. 

5 In performing the assays it may be desirable to include certain "blockers" in 

the incubation medium (usually added with the labeled soluble antibody). The 
"blockers" are added to assure that non-specific proteins, proteases, or anti- 
heterophilic immunoglobulins to anti-CACNAlG immunoglobulins present in the 
experimental sample do not cross-link or destroy the antibodies on the solid phase 
10 support, or the radiolabeled indicator antibody, to yield false positive or false negative 
results. The selection of "blockers" therefore may add substantially to the specificity 
of the assays described in the present invention. 

It has been found that a number of nonrelevant (i.e., nonspecific) antibodies of 
15 the same class or subclass (isotype) as those used in the assays (e.g., IgGl, IgG2a, 

IgM, etc.) can be used as "blockers". The concentration of the "blockers" (normally 1- 
100 .mu.g/.mu.l) may be important, in order to maintain the proper sensitivity yet 
inhibit any unwanted interference by mutually occurring cross reactive proteins in the 
specimen. 

20 

In using a monoclonal antibody for the in vivo detection of antigen, the 
detectably labeled monoclonal antibody is given in a dose which is diagnostically 
effective. The term "diagnostically effective" means that the amount of detectably 
labeled monoclonal antibody is administered in sufficient quantity to enable detection 

25 of the site having the CACNA1G antigen for which the monoclonal antibodies are 
specific. The concentration of detectably labeled monoclonal antibody which is 
administered should be sufficient such that the binding to those cells having 
CACNA1G is detectable compared to the background. Further, it is desirable that the 
detectably labeled monoclonal antibody be rapidly cleared from the circulatory system 

30 in order to give the best target-to-background signal ratio. 
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As a rule, the dosage of detectably labeled monoclonal antibody for in vivo 
diagnosis will vary depending on such factors as age, sex, and extent of disease of the 
individual. The dosage of monoclonal antibody can vary from about 0.001 
mg/m.sup.2 to about 500 mg/m.sup.2, preferably 0.1 mg/m.sup.2 to about 200 
5 mg/m.sup.2, most preferably about 0.1 mg/m.sup.2 to about 10 mg/m.sup.2. Such 
dosages may vary, for example, depending on whether multiple injections are given, 
tumor burden, and other factors known to those of skill in the art. 

For in vivo diagnostic imaging, the type of detection instrument available is a 
10 major factor in selecting a given radioisotope. The radioisotope chosen must have al 
type of decay which is detectable for a given type of instrument. Still another 
important factor in selecting a radioisotope for in vivo diagnosis is that the half-life of 
the radioisotope be long enough so that it is still detectable at the time of maximum 
uptake by the target, but short enough so that deleterious radiation with respect to the 
15 host is minimized. Ideally, a radioisotope used for in vivo imaging will lack a particle 
emission, but produce a large number of photons in the 140-250 keV range, which 
may be readily detected by conventional gamma cameras. 

For in vivo diagnosis, radioisotopes may be bound to immunoglobulin either 
20 directly or indirectly by using an intermediate functional group. Intermediate 

functional groups which often are used to bind radioisotopes which exist as metallic 
ions to immunoglobulins are the bifanctional chelating agents such as 
diethylenetriaminepentactic acid (DTP A) and ethylenediaminetetraacetic acid 
(EDTA) and similar molecules. Typical examples of metallic ions which can be 
25 bound to the monoclonal antibodies of the invention are .sup.l 1 1 In, .sup.97 Ru, 
.sup.67 Ga, .sup.68 Ga, .sup.72 As, .sup.89 Zr, and .sup.201 Tl. 



30 



A monoclonal antibody useful in the method of the invention can also be 
labeled with a paramagnetic isotope for purposes of in vivo diagnosis, as in magnetic 
resonance imaging (MRI) or electron spin resonance (ESR). In general, any 
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conventional method for visualizing diagnostic imaging can be utilized. Usually 
gamma and positron emitting radioisotopes are used for camera imaging and 
paramagnetic isotopes for MRL Elements which are particularly useful in such 
techniques include .sup.157 Gd, .sup.55 Mn 5 .sup.162 Dy, .sup.52 Cr, and .sup.56 Fe. 

5 

The present invention also provides a method for treating a subject with a cell 
proliferative disorder associated with of CACNA1G comprising administering to a 
subject with the disorder a therapeutically effective amount of reagent which 
modulates CACNA1G expression. In brain, breast and renal cancer cells, for 

10 example, the CACNA1G nucleotide sequence is under-expressed as compared to 
expression in a normal cell, therefore, it is possible to design appropriate therapeutic 
or diagnostic techniques directed to this sequence. Thus, where a cell-proliferative 
disorder is associated with the expression of CACNA1G associated with malignancy, 
nucleic acid sequences that modulate CACNA1G expression at the transcriptional or 

15 translational level can be used. In cases when a cell proliferative disorder or abnormal 
cell phenotype is associated with the under expression of CACNA1G , for example, 
nucleic acid sequences encoding CACNA1G (sense) could be administered to the 
subject with the disorder. 

20 The term "cell-proliferative disorder" denotes malignant as well as non- 

malignant cell populations which often appear to differ from the surrounding tissue 
both morphologically and genotypically. Such disorders may be associated, for 
example, with absence of expression of CACNA1G . Essentially, any disorder which 
is etiologically linked to expression of CACNA1G could be considered susceptible to 

25 treatment with a reagent of the invention which modulates CACNA1G expression. 



The term "modulate" envisions the suppression of methylation of CACNA1G 
polynucleotide when CACNA1G is under-expressed. When a cell proliferative 
disorder is associated with CACNA1G expression, such methylation suppressive 
30 reagents as 5-azacytadine can be introduced to a cell. Alternatively, when a cell 
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proliferative disorder is associated with under-expression of CACNA1G polypeptide, 
a sense polynucleotide sequence (the DNA coding strand) encoding CACNA1G 
polypeptide, or 5 f regulatory nucleotide sequences (i.e., promoter) of CACNA1G in 
operable linkage with CACNA1G polynucleotide can be introduced into the cell. 
5 Demethylases known in the art could also be used to remove methylation. 

The present invention also provides gene therapy for the treatment of cell 
proliferative disorders which are mediated by CACNA1G . Such therapy would 
achieve its therapeutic effect by introduction of the appropriate CACNA1G 
10 polynucleotide which contains a CACNA1G structural gene (sense), into cells of 
subjects having the proliferative disorder. Delivery of sense CACNA1G 
polynucleotide constructs can be achieved using a recombinant expression vector such 
as a chimeric virus or a colloidal dispersion system. 

15 The polynucleotide sequences used in the method of the invention may be the 

native, unmethylated sequence or, alternatively, may be a sequence in which a 
nonmethylatable analog is substituted within the sequence. Preferably, the analog is a 
nonmethylatable analog of cytidine, such as 5-azacytadine. Other analogs will be 
known to those of skill in the art. Alternatively, such nonmethylatable analogs could 

20 be administered to a subject as drug therapy, alone or simultaneously with a sense 

structural gene for CACNA1G or sense promoter for CACNA1G operably linked to 
CACNA1G structural gene. 

In another embodiment, a CACNA1G structural gene is operably linked to a 
25 tissue specific heterologous promoter and used for gene therapy. For example, a 

CACNA1G gene can be ligated to prostate specific antigen (PSA)-prostate specific 
promoter for expression of CACNA1G in prostate tissue. Other tissue specific 
promoters will be known to those of skill in the art. Alternatively, the promoter for a 
tumor suppressor gene can be linked to the CACNA1G structural gene and used for 
30 gene therapy. 
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Various viral vectors which can be utilized for gene therapy as taught herein 
include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a 
retrovirus. Preferably, the retroviral vector is a derivative of a murine or avian 
retrovirus. Examples of retroviral vectors in which a single foreign gene can be 
5 inserted include, but are not limited to: Moloney murine leukemia virus (MoMuLV), 
Harvey murine sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), 
and Rous Sarcoma Virus (RSV). Most preferably, a non-human primate retroviral 
vector is employed, such as the gibbon ape leukemia virus (GaLV), thereby providing 
a broader host range than murine vectors, for example. 

10 

A number of additional retroviral vectors can incorporate multiple genes. All 
of these vectors can transfer or incorporate a gene for a selectable marker so that 
transduced cells can be identified and generated. Retroviral vectors can be made target 
specific by inserting, for example, a polynucleotide encoding a sugar, a glycolipid, or 
15 a protein. Preferred targeting is accomplished by using an antibody to target the 
retroviral vector. Those of skill in the art will know of, or can readily ascertain 
without undue experimentation, specific polynucleotide sequences which can be 
inserted into the retroviral genome to allow target specific delivery of the retroviral 
vector containing the CACNA1G sense or antisense polynucleotide. 

20 

Since recombinant retroviruses are defective, they require assistance in order 
to produce infectious vector particles. This assistance can be provided, for example, 
by using helper cell lines that contain plasmids encoding all of the structural genes of 
the retrovirus under the control of regulatory sequences within the LTR. These 
25 plasmids are missing a nucleotide sequence which enables the packaging mechanism 
to recognize an RNA transcript for encapsidation. Helper cell lines which have 
deletions of the packaging signal include but are not limited to .PSI.2, PAS 17 and 
PA12, for example. These cell lines produce empty virions, since no genome is 
packaged. If a retroviral vector is introduced into such cells in which the packaging 
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signal is intact, but the structural genes are replaced by other genes of interest, the 
vector can be packaged and vector virion produced. 

Another targeted delivery system for CACNA1G polynucleotide is a colloidal 
5 dispersion system. Colloidal dispersion systems include macromolecule complexes, 
nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water 
emulsions, micelles, mixed micelles, and liposomes. The preferred colloidal system of 
this invention is a liposome. Liposomes are artificial membrane vesicles which are 
useful as delivery vehicles in vitro and in vivo. It has been shown that large 

10 unilamellar vesicles (LUV), which range in size from 0.2-4.0 um can encapsulate a 
substantial percentage of an aqueous buffer containing large macromolecules. RNA, 
DNA and intact virions can be encapsulated within the aqueous interior and be 
delivered to cells in a biologically active form (Fraley, et al., Trends Biochem. Sci., 
6:77, 1981). In addition to mammalian cells, liposomes have been used for delivery of 

15 polynucleotides in plant, yeast and bacterial cells. In order for a liposome to be an 
efficient gene transfer vehicle, the following characteristics should be present: (1) 
encapsulation of the genes of interest at high efficiency while not compromising their 
biological activity; (2) preferential and substantial binding to a target cell in 
comparison to non-target cells; (3) delivery of the aqueous contents of the vesicle to 

20 the target cell cytoplasm at high efficiency; and (4) accurate and effective expression 
of genetic information (Mannino, et al., Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, 
particularly high-phase-transition-temperature phospholipids, usually in combination 
25 with steroids, especially cholesterol. Other phospholipids or other lipids may also be 
used. The physical characteristics of liposomes depend on pH, ionic strength, and the 
presence of divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl 
30 compounds, such as phosphatidylglycerol, phosphatidylcholine, phosphatidylserine, 
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phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides. Particularly 
useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative 
phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
5 distearoylphosphatidylcholine. 

The targeting of liposomes has been classified based on anatomical and 
mechanistic factors. Anatomical classification is based on the level of selectivity, for 
example, organ-specific, cell-specific, and organelle-specific. Mechanistic targeting 

10 can be distinguished based upon whether it is passive or active. Passive targeting 
utilizes the natural tendency of liposomes to distribute to cells of the reticulo- 
endothelial system (RES) in organs which contain sinusoidal capillaries. Active 
targeting, on the other hand, involves alteration of the liposome by coupling the 
liposome to a specific ligand such as a monoclonal antibody, sugar, glycolipid, or 

15 protein, or by changing the composition or size of the liposome in order to achieve 
targeting to organs and cell types other than the naturally occurring sites of 
localization. 

The surface of the targeted delivery system may be modified in a variety of 
20 ways. In the case of a liposomal targeted delivery system, lipid groups can be 

incorporated into the lipid bilayer of the liposome in order to maintain the targeting 
ligand in stable association with the liposomal bilayer. Various linking groups can be 
used for joining the lipid chains to the targeting ligand. 

25 In general, the compounds bound to the surface of the targeted delivery system 

will be ligands and receptors which will allow the targeted delivery system to find and 
"home in" on the desired cells. A ligand may be any compound of interest which will 
bind to another compound, such as a receptor. 

30 
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In general, surface membrane proteins which bind to specific effector 
molecules are referred to as receptors. In the present invention, antibodies are 
preferred receptors. Antibodies can be used to target liposomes to specific cell-surface 
ligands. For example, certain antigens expressed specifically on tumor cells, referred 
5 to as tumor-associated antigens (TAAs), may be exploited for the purpose of targeting 
CACNA1G antibody-containing liposomes directly to the malignant tumor. Since the 
CACNA1G gene product may be indiscriminate with respect to cell type in its action, 
a targeted delivery system offers a significant improvement over randomly injecting 
non-specific liposomes. Preferably, the target tissue is human brain, colon, breast, 

10 lung, and renal origin. A number of procedures can be used to covalently attach either 
polyclonal or monoclonal antibodies to a liposome bilayer. Antibody-targeted 
liposomes can include monoclonal or polyclonal antibodies or fragments thereof such 
as Fab, or F(ab0.sub.2, as long as they bind efficiently to an antigenic epitope on the 
target cells. Liposomes may also be targeted to cells expressing receptors for 

15 hormones or other serum factors. 

It should be noted that as used herein and in the appended claims, the singular 
forms "a," "and," and "the" include plural referents unless the context clearly dictates 
otherwise. Thus, for example, reference to "a cell" includes a plurality of such cells 
20 and reference to "the restriction enzyme" includes reference to one or more restriction 
enzymes and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have 
the same meaning as commonly understood to one of ordinary skill in the art to which 
25 this invention belongs. Although any methods, devices and materials similar or 
equivalent to those described herein can be used in the practice or testing of the 
invention, the preferred methods, devices and materials are now described. 



30 



All publications mentioned herein are incorporated herein by reference in full 
for the purpose of describing and disclosing the methodologies which are described in 
the publications which might be used in connection with the presently described 
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invention. The publications discussed above and throughout the text are provided 
solely for their disclosure prior to the filing date of the present application. Nothing 
herein is to be construed as an admission that the inventors are not entitled to antedate 
such disclosure by virtue of prior invention. 

5 

The following example is intended to illustrate but not to limit the invention in 
any manner, shape, or form, either explicitly or implicitly. While they are typical of 
those that might be used, other procedures, methodologies, or techniques known to 
those skilled in the art may alternatively be used. 

10 

EXAMPLE 1 

IDENTIFICATION OF CACNA1G AS A TARGET FOR 
HYPERMETHYLATION ON HUMAN CHROMOSOME 17q21 

15 In order to isolate genes differentially methylated in cancer cells as opposed to 

non-cancerous cells the following experimental protocols were used. An example of 
the results obtained is provided hereinabove in the description of the isolation and 
characterization of human CANNA1G. 

20 Tissue Samples and Cell Lines 

Forty-nine primary colorectal cancers, 28 colorectal adenomas, 16 primary 
gastric cancers and 17 acute myelogenous leukemia samples were used for 
methylation analyses. DNA from eight colon cancer cell lines (Caco2, RKO, SW48, 
HCT1 16, DLD1, Lovo, SW837, HT29), 4 lung cancer cell lines (OH3, H249, H157, 

25 H209), 4 glioblastoma cell fines (Dauy, D283, U87, U373), 8 breast cancer cell lines 
(MB-468, MCF7, MB-231, MB-474, MB-435, MB-453, BT20, CAMA1, SKBR3), 7 
hematopoietic tumor cell lines (CEM, Raji, KG1A, HL60, ML-l,MoIt3, K562), and 4 
prostate cancer cell lines (DU145, DUPRO, LNCAP, TSUPRL) were also 
investigated. DNA was extracted by standard procedures. RNA was isolated from 

30 cell lines and adenomas using TRIZOL (GIBCO-BRL). For re-expression analysis, 
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cell lines were treated with 5-Aza-deoxycytidine (SIGMA) at a final concentration of 
1 M for 6 days. All tissue samples were obtained from patients who gave informed 
consent according to institutional guidelines. 

5 RT-PCR. 

Six g of total RNA, was reverse transcribed using the SUPERSCRIPT kit 
(GIBCO-BRL) for first strand cDNA synthesis. One hundred ng of cDNA was used 
as template for RT-PCR reactions. To design the RT-PCR primers. Blast search was 
performed using the rat CacnalG cDNA sequence (Genbank AF027984) reported 

10 previously (25) and exon-intron boundaries of the human CACNA1G were predicted 
by this analysis. Each primer set was designed to amplify the cDNA across several 
introns. Primer sequences and PCR conditions are available, at 
http://www.med.jhu.edxi/methylation/primers. GAPDH was also amplified as a 
control using primers GAPDHF: 5 '-CGGAGTCAACGGATTGGTCGTAT-3 5 (SEQ 

15 ID NO:53) and GAPDHR: 5 ' - AGCCTTCTCC ATGGTGGTGAAGAC-3 9 (SEQ ID 
NO:54). All reactions were performed with RT(-) controls. PCR amplification was 
performed for 35 cycles of 95° C 30 see, 60-65°C for 30 sec, 72°C for 30 sec 5 and the 
products were analyzed by agarose gel electrophoresis. 

20 DNA Sequencing and Data Analysis. 

PCR reaction products were precipitated with ethanol, resuspended in diluted 
water and cloned into the pCR2.1 vector using the TA cloning kit (Invitrogen) 
according to the manufacturer's instruction. After transformation, plasmid DNA was 
purified using the Wizard Miniprep Kit (Promega). DNA sequence analysis was 

25 carried out at the Johns Hopkins University Sequence Facility using automated DNA 
sequencers (Applied Biosystems). Sequence homology was identified by the BLAST 
program of the National Center for Biological Information (NCBI) available at 
http://www.ncbi.nlm.nih.gov/ BLAST/. An IMAGE cDNA clone (Genbank: H13333) 
was identified by BLAST analysis using the sequence of BAC AC004590 (Genbank) 

30 which includes MINT3 1 . Putative genes (Gl and G2) were identified by GENSCAN 
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(available at http://ccr-081.mit.edu/GENSCANMIT.html) using the BAC sequence 
data. IMAGE cDNA clone HI 333) was then obtained from the American Type 
Culture Collection and completely sequenced. Potential transcription factor binding 
sites and promoter prediction were examined using the TESS and TSSG programs 
5 respectively, available at the Baylor College of Medicine BCM Launcher 

(http://kiwi.imgen.bcm.tmc.edu: 8088/search launcher/launcher.html/). The nucleotide 
sequence of part of the 5' end of the cDNA of CACNA1G has been submitted to 
Genbank. 

1 0 Bisulfite-PCR Methylation Analysis. 

Bisulfite treatment was performed as reports previously. (Herman, J. G., Graff, 
J. R., Myohanen, S., Nelkin. B. D. and Baylin, S.B. Proc. Natl. Acad. Sci. USA. 
93:9821-9826, 1996). Briefly, 2 g of genomic DNA was denatured with 2 M NaOH 
for 10 min., followed by incubation with 3 M Na-bisulfite, pH 5.0 for 16 hours at 

15 50°C. After treatment DNA was purified using a Wizard Miniprep Column 

(Promega), precipitated with ethanol and resuspended in 20 1 of diluted water. Two 1 
of the aliquot was used as template for each PGR reaction. Semi-quantitative 
bisulfite-PCR was performed essentially as described. Xiong, Z. and Laird, P. W. 
COBRA: a sensitive and quantitative DNA methylation assay. Nucleic Acids Res. 

20 25:2532-2534, 1997. To avoid overestimation of the methylated alleles, the following 
points were considered. First, primers were designed to contain a minimum number 
of CpG dinucleotides in the sequence to avoid the biased amplification of methylated 
alleles. If primers do contain CpG sites, they were designed to amplify methylated 
and unmethylated alleles equally (using a mixture of C or T for sense and a mixture of 

25 G or A for antisense primers). Second, the primers were designed to contain a 

maximum number of thymidines converted from cytosines to avoid amplifying the 
non-converted genomic sequence. Third, restriction sites which only appear after 
bisulfite conversion (e.g. ACGC to ACGT) were used (regions 1-8). PCR was 
performed as described previously (Herman, supra). Primer sequences, annealing 

30 temperature and PCR cycles are available at http://www.jhu.edii/methylation/primers. 
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Twenty % of the PGR products were digested with the appropriate restriction 
enzymes, precipitated with ethanol and separated by 5% polysacrylamide gel 
electrophoresis. Gels were stained with ethidium bromide, and the intensity of each 
allele was calculated by densitometry, using the Image Quant software (Molecular 
5 Dynamics). 

Although the invention has been described with reference to the presently 
preferred embodiment, it should be understood that various modifications can be 
made without departing from the spirit of the invention. Accordingly, the invention is 
10 limited only by the following claims. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising the coding region for a T-type 
calcium channel and regulatory sequences associated therewith. 

2. The nucleic acid molecule of claim 1 3 wherein said associated regulatory 
sequences contain CpG-rich regions. 

3. The nucleic acid molecule of claim 2 ? wherein the state of methylation of the 
CpG-rich regions is determinative of the presence of a cellular proliferative 
disorder in a subject from which the nucleic acid molecule is isolated. 

4. The nucleic acid molecule of claim 2, wherein hypermethylation of said CpG 
islands is indicative of the presence of a cellular proliferative disorder in a subject 
from which said nucleic acid is isolated. 

5. The nucleic acid molecule of claim 1, wherein said T-type calcium channel is 
CACNA1G. 

6. The nucleic acid molecule of claim 5, wherein said nucleic acid comprises the 
nucleic acid sequence of SEQ ID NO: 51. 

7. The nucleic acid molecule of claim 6, wherein one or more of regions 1-8 
comprises methylated bases. 

8. A substantially purified polypeptide encoded by the polynucleotide of SEQ ID 
NO:51. 

9. The polypeptide of claim 8, wherein the polypeptide has an amino acid sequence 
as set forth in SEQ ID NO:52. 
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10. A method for detecting a cellular proliferative disorder in a subject comprising: 

a) contacting a nucleic acid-containing specimen from the subject with an agent 
that provides a determination of the methylation state of at least one gene or 
associated regulatory region of the gene; and 

b) identifying aberrant methylation of regions of the gene or regulatory region, 
wherein aberrant methylation is identified as being different when compared to 
the same regions of the gene or associated regulatory region in a subject not 
having said cellular proliferative, thereby detecting a cellular proliferative 
disorder in the subject. 

11. The method of claim 10, wherein the regions of said gene are contained within 
CpG rich regions. 

12. The method of claim 10, wherein the gene is selected from the group consisting of 
APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, KL, PAR2, 
PITX2, PTCH, SDC1, SDC4 and combinations thereof. 

13. The method of claim 10, wherein aberrant methylation comprises 
hypermethylation when compared to the same regions of the gene or associated 
regulatory regions in a subject not having the cellular proliferative disorder. 

14. The method of claim 13, wherein the regions comprise regulatory regions of 
CACNA1G. 

15. The method of claim 14, wherein the regions comprise regions 1-8 of CACNA1G. 

16. The method of claim 15, wherein the regions comprise regions 1-2 of CACNA1G. 

17. The method of claim 15, wherein the regions comprise regions 5-7 of CACNA1G. 
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18. The method of claim 15, wherein the regions comprise regions 4 and 8 of 
CACNA1G. 

19. The method of claim 10, wherein the agent is a pair of primers that hybridize with 
a target sequence in the gene or associated regulatory region of the gene. 

20. The method of claim 19, wherein the primers hybridize with a target 
polynucleotide sequence having the sequence selected from the group consisting 
of SEQ ID NO:55-103 and SEQ IDNO:104. 

21. The method of claim 20, wherein the primers are in consecutive pairs selected 
from the group consisting of SEQ ID NO: 1-49 and SEQ ID NO:50. 

22. The method of claim 10, wherein the nucleic acid-containing specimen comprises 
a tissue selected from the group consisting of brain, colon, urogenital, lung, renal, 
prostate, pancreas, liver, esophagus, stomach, hematopoietic, breast, thymus, 
testis, ovarian, and uterine. 

23. The method of claim 10, wherein the nucleic acid-containing specimen is selected 
from the group consisting of serum, urine, saliva, blood, cerebrospinal fluid, 
pleural fluid, ascites fluid, sputum, stool, and biopsy sample. 

24. The method of claim 10, wherein said cellular proliferative disorder is selected 
from the group consisting of low grade astrocytoma, anaplastic astrocytoma, 
glioblastoma, medulloblastoma, gastric cancer, colorectal cancer, colorectal 
adenoma, acute myelogenous leukemia, lung cancer, renal cancer, leukemia, 
breast cancer, prostate cancer, endometrial cancer and neuroblastoma. 
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25. A kit useful for the detection of a cellular proliferative disorder in a subject 
comprising: 

a) carrier means compartmentalized to receive a sample therein; 

b) one or more containers comprising a first container containing a reagent which 
modifies unmethylated cytosine and a second container containing primers for 
amplification of a CpG-containing nucleic acid, wherein the primer hybridizes 
with a target polynucleotide sequence having the sequence selected from the 
group consisting of SEQ ID NO:55-103 and SEQ ID NO: 104. 

26. The kit of claim 25, further comprising a third container containing a methylation 
sensitive restriction endonuclease. 

27. The kit of claim 25, wherein said modifying reagent is bisulfite. 

28. The kit of claim 25, wherein the primers are selected from the group consisting of 
SEQ ID NO: 1-49 and SEQ ID NO:50. 

29. Isolated oligonucleotide primer(s) for detection of a methylated CpG-containing 
nucleic acid wherein the primer hybridizes with a target polynucleotide sequence 
having the sequence selected from the group consisting of SEQ ID NO:55-103 
and SEQ ID NO: 104. 

30. The primers of claim 29, wherein said primers are selected from the group 
consisting of SEQ ID NO:l-49 and SEQ ID NO:50. 

31. An isolated nucleic acid molecule having at least one methylated Cytosine of a 
CpG dinucleotide in a CpG-rich region and encoding a gene selected from the group 
consisting of APOB, CACNA1G, CDX2, EGFR, FBN1, GPR37, HSPA6, IQGAP2, 
KL, PAR2, PITX2, PTCH, SDC1, and SDC4. 
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32. The nucleic acid molecule of claim 31, wherein the methylated C residue of a 
CpG dinucleotide is located within a CpG-rich region selected from the group 
consisting of SEQ ID NO: 105-1 18 and SEQ ID NO: 1 19. 
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(SEQ ID NO: 51) CACNA1G nucleotide sequence 
CCTTTTCGTTCGCCCTCTCGGCX3C 

7UVGCCGGAACAAAGTGAGGGGGAGCCGGCCGGCTGGCCCGGGAAGCCCCAGGGGCGCAGG<^ 

GCGGGGTTTCCCTGCGCCCCGGCGCCCCGCGGGCAGCATGCCCCTGCGGGCAGGGGGAGCTGGGCTGAACTGGCCCT 
GGGGGCTCAGCTTGCGCCCTAGAGCCGACCAGATGTGCCCCCGCCGGGGCCCCCGGGT^ 

GGGGCGCCGCTTGCCCCTCTCCGGATCGCCCGGGGCCCCGGCTGGCCAGAGGATGGACGAGGAGGAGGATGGAGCGGGCG 
CCGAGGAGTCGGGACAGCCCCGGAGCTTCATGCGGCTCAAC 

GCAGAAAAGGACCCGGGCAGCGCGGACTCCGAGGCGGAGGGGCTGCCGTACCCGGCGCTGGCCCCGGTGGTTTTCTTCTA 

CTTGAGCCAGGACAGCCGCCCGCGGAGCTGGTGTCTCCGCACGGTCTGTAACCCCTGGTTTGAGCGCATCAGCATGTTGG 

TCATCCTTCTCAACTGCGTGACCCTGGGCATGTTCCGGCCATGCGAGGACATCGCCTGTGACTCCCAGC 

CTGCAGGCCTTTGATGACTTCATCTTTGCCTTC 

GAAAAAGTGTTACCTGGGAGACACTTGGAACCGGCTTGACTTT^ 

ACCTGCAGAACGTCAGCTTCTCAGCTGTCAGGACAGTCCGTGTGCT^ 

ATGCGCATCCTTGTCACGTTGCTGCTGGATACGCTC 

CATCTTCGGCATCGTCGGCGTCCAGCTGTGGGCAGGGCT^ 

CCCTGAGCGTGGACCTGGAGCGCTATTACCAGACAGAGAACGAGGATGAGAGCCCCTT 

AACGGCATGCGGTCCTGCAGAAGCGTGCC 

GGCCTAGAAmGCTCCAGCAACACCACCrGTGT 

CCTTCAAGGGCGCCATCAACTTTGACAACATT^ 

GTCGACATCATGTACTTTGTGATGGATGCT 

CTTCTTCATGATC^CCTGTGCCTGGTGGTGATTGC 

GGGAGCAGCGTGTGCGGTTCCTGTCCAACGCCAGCACCCTC^ 

CTCAAGTACCTGGTGTACATCCriTCGTAAGGCAGCCCGCAGGCTG 

GCTGCTCAGCAGCCC^GCACCCCTCGGGGGCC^GGAGACCCAGCCCAGCA 

CCGTCCACCACCTGGTGCACCACC^CCACCACCATCACCACCACTACCACCTGGGCAATGG^ 

GCCAGCCCGGAGATCCAGGACAGGGATGCCAATGGGTCCCGCCGGCTCATGCTGCCACCACCCTCGACGCCTGCCCTCTC 
CGGGGCCCCCCCTGGTGGCGCAGAGTCTGTGC^CAGCT^ 

TOCCCCCTCCCAGGTCCCCATCTGAGGCATCCGGCAGGACTGTGGGCAGCGGGAAGGTC 

CCTCCACCGGAGACGCTGAAGGAGAAGGCACTAGTAGAGGTGGCTGCGAGCTCTGGGCCCCCAAC 

CATCCCACCCGGGCCCTACAGCTCCATGCACAAGCTG 

TCTCCAGCCCTTGCTTGAAAGCAGACAGTGGAGCCTGTGGTCCAGACAGCTGCCCCTACTGTGCCCG 
GAGGTGGAGCTCGCCGACCGTGAAATGCCTGACTCAGAC&GC 

CGACCTCCGGGACCCCCACAGCCGGCGGCAACGGAGCCTGGGCCCAGATGCAGAGCCCAGCTCTGTC 

GGCTAATCTGTGACACCTTCCGAAAGATTGTGGACAGC^ 

ACACTCAGCATGGGCATTCAATACCACGAGC^ 

CAGCCTCTTTGCCCTGGAGATGCTGCrrGAAGC 

TCGATGGTGTCATTGTGGTCATCAGCGTGTGGGAGA 

CGCCTGATGCGTGTGCTGAAGCTGGTGCGCTTCCT^ 

CAACGTGGCCACCTTCTGCATGCTGCTTATC 

AGTTTGCCTCTGAGCG<k3ATGGGGACACCCTGCCAGACCGGAAGAATTTO 

TTTC^GATCCTGACCCA.GGAGGACTGGAACAAAGTCCTCTACAATGGTATGGCCTCCAC 

TTTCATTGCCCTC^TGACCTTCGGCAACTAC^ 

AGGGAGATGCCAACAAGTCCGAATCAGAGCCCGATTTCTTCTCACC<^ 

TTGGCCTTGGTGTCCCTGGGAGAGCACCCGGAGCTGCGGAAGAGCCTGCTGCCGCCTCTCATCATCCACACGGCCGCCAC 
ACCCATGTCGCTGCCCAAGAGCACCAGCACGGGCCTGGGCGAGGCGCTGGGCCCTGCGTCGCGCCGCAC 
GGTCGGCAGAGCCTGGGGCGGCCCACGAGATGAAGTCACCGCCCAGCGC 
GCAAG(^GCTGGACCAGCAGGCGCTCCAGCCGGAACAGCC 

AGAGCGGCGGTCCCTGTTGTCGGGAGAAGGCCAGGAGAGCCAGGATGAAGAGK3AGAGCTCAGAAGAGGAGCGGGCCAGCC 
CTGCGGGCAGTGACCATCGCCACAGGGGGTCCCTGGAGCGGGAGGCCAAGAGTTCCTTTGACCTC 
GTGCCAGGGCTGCATCGCACTGCCA.GTGGCCGAGGGTCTGCTTCTIX3 
GCGCCTGGCCCGGGCCCTGCGGCCIX3ATGACCCCCCAC^ 
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(SEQ ID N0:52) 



MDEEEDGAGAEESGQPRSFMRIjNDIjSGAGGRPGPGSAEKDPGSA 
DSEAEGLPYPAI^VVFFYLSQDSRPRSWCLRTVOT 
PCEDIACDSQRCRIIiQAFDDFIFAFFAVE^^VVK^^ 
IAGMLEYSIJDLiQNVSFSAVRTVRVTjRP^ 

FFVFFIFGIVGV(5LWAGLIjRNRCFLPENFSLPLSVDIiERYYQTENEDESPFICSQPRE 
NGMRS CRS VPTLRGDGGGG P P CGIjD yeayns s snttcvnwnq yytncs agehnp fkga 
INFDNIGYAWrAIFQVITLEGVnroiMYFVMDAHSFYOT 
V^TQFSETKQRESQLMREQRVR^L^ 

IiAQVSRAAGWVGKLSSPAPLGGQETQPSSSCSRSHRRLSVHH^^ 
NGTIiRAPRASPEIQDRDANGSRRIiMLPPPSTPAIiSGAPPGGAESVHSFYHADCHIjEPV 
RCQAPPPRSPSEASGRTVGSGKVYPTVHTSPPPETI^CEKAIiVEVAA^ 
PPGPYSSMHKLLETQSTGACQSSCK^SSPCLKADSGATO 

REMPDSDSEAVYBFTQDAQHSDIjRDPHSRRQRSIiGPDAEPSSVIiAFWRXjICDTFRKJV 

DSKYFGRGIMilAILVNTLSMGIEYHEQPEELTN^ 

PFGYIKNPYNIFIXTVIWISVT^IVGOQGGGLSV^RTFRIiMRVIiKL 

VliMKTMDNVATFCMIiLMIiFI F I FS IliGMHIiFGCKFASERDGDTIjPDRKNFDSIiliWAIV 

TVFQIIiTQEDWNKVLYNGMASTSSWAALYF^ 

ANKSESEPDFFSPSLIXSDGDRKKCIiALVSIiGE 

TSTGI^EAMPASRRTSSSGSAEPGAAHEMKSPPSARSSPHSPWSAASSWTSRRSSRN 
SIXSRAPSLKRRSPSGERRSIjLSGEGQESQDEEESSEEERASPAGSDHRHRGSIiEREAK 
S S FDLPDTIiQVPGLHRTASGRGS AS EHQDCNGKS AS GRIiARALRPDD P PIjDGDDADDE 

gnl 

FIG. 3B 
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APOB CpG ISLAND (SEQ ID NO: 105) 

cccgggaggcgcccmggaccttttgcaatcctggcgctcttgcagcctgggcttcctataaatggggtgcgggcgccggccgcgcattc 

ccaccgggacctgcggggctgagtgcccttctcggttgctgccgctgaggagcccgcccagccagccagggccgcgaggccgaggcc 

aggccgcagcccaggagccgccccaccgcagctggcgatggacccgccgaggcccgcgctgctggcgctgcctgcgctgctgctgct 

gctgctggcgggcgccagggccggtgagtgcgcggccgctctgcgggcagcagagggagcgggagggagccggcggaccgaggtt 

ggccggggcagcctgggcctaggccagagggagggcagccacagggtccagggcgagtggggggattggaccagctggcggcccc 

tgcaggctcaggatggggggcgcgggatggaggggctgaggagggggtctccggagcctgcctccctcctgaaaggtgaaacctgtgc 

cggtggtccccctgtcgggcccctagcacccgctgggaagacgtgggaagctc 

CACNA1G CpG ISLAND (SEQ ID NO: 106) 

cctgcggccctacgccaggaccccgcgccgaata^ 

aaagtgggttcgg^ctgggcagcagtctgattggctccagccttcgggagcggacccaggggcaaggg 

gggtmggggtgggaatcggattccagctgtggttctctccctgcgctcccgcccgcactgccacggcggacggccaat^ 

ctcggggccggcggcgtccggcgattggctgcggggctgtctgggggcggggccgaggcttgaagttgaagtgagggatccagctgtg 

gtgtgcgcggggctcctcgccgccgctttcgctcgctcgctccgcgtctcggccggaggaggaggctgtggcgccggcgacagctacg 

gcagcggcagccaccgcggcggctgcggcggcggcatctccgcctccactcccgcccgggactgccccccactgtctccccgcccctc 

ccggacagtgagcccgcggcggggcgggggaaggagccgcccccaccccctccaagcccacccctaaagagatccctcctcccctcc 

cccgccgcctggcgcggagccgggacgatgctgaccccttagatccggctccagctgcgccgcgggaagagggggcgcccctccccg 

gacccccgccctccgccgctgccccccttttcgttcgccctctcggggcggcttcgccgaaggtagcgccgaatccggcaaccggagcct 

gggcgcgaagcgaagaagccggaacaaagtgagggggagccggccggctggcccgggaagccccaggggcgcaggggaagcgg 

gactcgcgccgggcggggmc^ctgcgccccggcgccccgcgggcagcatgcccctgcgggcagggggagctgggctgaactggc 

cctcccgggggctcagcttgcgccctagagcccaccagatgtgcccccgccggggcccc^gggttgcgtgaggacacc 

Ggcgccgcttgcccxtctccggatcgcccggg 

gtcgggacagccccggagcttcatgcggctcaacgacxtgt^ 

gggcagcgcggactccgaggcggaggggctgccgtacx^ggcgctggccccggtggttttcttctactt^ 

cggagctggTgtctccgcacggtctgtaacccatatrc^ 

cgccgggggtcgggggggaagaagacccaccgccaggtgagtcgaagtga 

agggaggggagggggcaccagagtgggagcggagacgcgagcaggtctcgtcggtaacccgggcttaccccacctgcgtacacaca 

cctcagtcttcctgggttgggggggtggggatccaggccaggagaagagagctgtgccccgctggctcgcagctggacgc^ 

gtggtcaggggagggtcgtcatc^tccagatgtgggaagcttcgggagcctgggagctgtactctgcccgcgccggttagcga 

ttggmccgagmggtggggggtgggtgggggcggtggggaggaagctgcggggacggaggaggggggaccgcaatctcctg 

tccctccttcccccgccccaaagtttgcggcggattctagatgttggggggcggggaccaggtcctggcccacctcaccccccacctcgcg 

ggttggaggcacaacaaggagattccggcggcggctgatgtcaggggcgcagaatgagaacaagatgtggtggaggggagctgtctgc 

ccccggagctgggagtggagcccctttccgctagagcccagtgccgcgggtgcctcctacccgatctccattcgatgc 
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CDX2 CpG Island (SEQ ID NO: 107) 
ctcgttaatcacggaagccgccggcctggggctccgcacgc^ 

gggaggaaagaaggaagaaagggagggagggaggaggcaggccagagggagggaccgcctcggaggcagaagagccgcgagg 

agccagcggagcaccgcgggctggggcgcagccacccgccgctcctcgagtcccctcgcccctttcccttcgtgccccccggcagcctc 

cagcgtcggtccccaggcagcatggtgaggtctgctcccggtccctcgccaccatgtacgtgagcta 

EGFR CpG Island (SEQ ID NO: 108) 

gtccgcggggaccgggtccagaggggcagtgctgggaacgcccctcte^ 
gccccactcccgccggagactaggtccxgcgggggcc^ 
cctccctcctcctcgcattctcctcctcctctgct^ 
ggcccgcgcgagctagacgtccgggcagccccxggcgcag^ 

gcgccgaggcggccggagtcccgagctagccccgcggccgccgccgcccagaccggacgacaggccacctcgtcgcgtccgcxcg 

agtccccgcctcgccgccaacgccacaaccaccgcgcacggccccctgactccgtccagtattgatcgggagagccggagcgagctctt 

cggggagcagcgatgcgaccctccgggacggccggggcagcgctcctggcgctgctggctgcgctctgccCggcgagtcgggctc^ 

Gaggaaaagaaaggteagggcgtgtctcgcggctccccgccgcccccggatcgcgccccggaccccgcagcccgcccaaccgcacc 

gcgcaccggcttcgcccgcgccxccgcccgtccmcctgtttccttgagatcacgtgcgccgccgaccgggaccgcggg 

acgmcgttcttcggccgggagagtctggggcgggcggaggaggagacgcgtgggacaccgggctgcaggccaggcggggaacgg 

ccgccgggacctccggcgccccgaaccgctcccaactttctte^ 

agccgggtgctggtgggcgcctggggccggggtcccgcacgggctccccgcgctgtcttcccagggcgcgacggggtcctggcgcgc 

acccgagggccgctgcccacccgccgagactgcctgtttagggaagctgaggaaggaacccaaaaatacagcctccgctcggaccccg 

cgggacaggcggctttctgagaggacctcx^ccgcctccgcgctccgcgcaggtctcaaactgaagccggcgcccgccag 

ggcccctctccaggtccccgcgatcctcgtt^ 

cggctcccggggaggggtggtgctggcggcggttagM 

cccggccgctcgcctcgcccggtgcgccctc^^ 

gccgaaagccccaggctctccttcgatggccgcctcgcggagacgtccgggtctgctccacctgcagcccttcggt 
gcggtggagcgggacgcggctgtccggccactgcagggggggatcgcgggactcttgagcggaagccccg 
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FBNl CpG Island (SEQ ID NO: 109) 

agagccgcgtctggagtgggctctcgacacccagggcaagtgggggcggcagagccctctcctcggtcggcacagcagcctctgccgcggtcccgg 
cctgcgacgcgcccagtcttagcctcccggcctccggcgtctgctgagtgtccggcgggagaggcgcagggagcgcgctaccgggaggcgcgggc 
agcggggactggtmctctcgggccagggcctccggggcaacc^^ 
cacctatcgcgggagaggttaatctcggatctaaacctcgcagccgcagagcg 

aagacaaaaagtcccaggccgggcaggacctgatcacctctgcctcctcccactgcgctaatcctgcgagcgagaggccccgcaccgaggcggagg 

ctgcaaaggggagtggaaagggaggatggatggggccggggggtggggtggtgat 

ttaaaaaaagtatttctctcgcgagaaaccgctgcgcggac^^ 

ggtgccacaagcggacaggagccacagctgGgacagctgcgagcggagccgagcagtggctgtagcggccacgactgggagcagccgccgccg 

cctcctcgggagtcggagccgccgcttctccagtgggtgcagccggggtccgacgggggtcgggcggccaccggggctggagctgcggccacgga 

ggctmgcgtttgcgccgnimgagggcagggacagggactgg 

gacagcttcctgnccggggcttcttggtgccggnccg^ 

ccagctggcggagagggcaggctgaggagtggggcgttcagagcgcgcatcgcgcgcaattcgtgccgctaaaaaaaataaacccagagag 
ccggggcttaggaccgctggggatatgggtactttgcgccgcgctcttctggcggggcccgggaggcgagggattggccggggctgctgcgccgg 
gcctgggctttccagccagctgtggaccaaacggtcttcccttacccaaattaactgcgccacaggcggccgacn 
gcgagcttcagcatcccgatgccctgaaagtctccc^ 

ggagaaggcgggaggagcccggcccgggggacgggcggcgggatagcgggaccccggcggcgcggtgcgcttcagggcgcagcggcggccg 

cagaccgagccccgggcgcggcaagaggcggcgggagccggtggcggctcggcatcatgcgtcgagggcgtctgctggagatcgccctgggattt 

accgtgcttttagcgtcctacacgagccatggggcggacgcc^ 

ggcggtggaggacacgacgcgcttaaagggtaaaggaaccggttccctc 

GPR37(SEQIDNO:110) 

Tcccgccccgcacccgcccctagcccgggrt^ 
ccgcgcaggctccgcgcctccgcccttgftccctrc^ 

gtgaaccaccaggggatcccgtctccccacaaaccagtatctctccgaggaggaggcgaaggagtgggaggaggcaacgagccg 
cgcgggcgcgcgcagcggctggagcgcgggggcgaggccgggccacctccccttcccggccgcgcactgcctggcccgcggcggttccaggca 
ccacccttcccgtccgggctgagc^cgctgtggcagtgactagctcccgcggctagcggcactgtccaccga 
ccccacgatttccttctctgcggcggcacgccgto 

ggggttggaatcccgcttctcctcggcagcagtaggctcgcaagtcgctggggttaggtgggg 

tgtttgcaacgtgmccagcgagctgggagcggggttgtgac^^ 

actggatcaaacgctgtcgaaa 



HSPA6 CpG Island (SEQ ID NO:l 1 1) 

tgtattcgcatggtaacatatcttcggtcttcctgccgctgggctctcagcggccctccaaggcagcccgcaggcccgtgctcgcctcagggatcctcc 

agccccggggagac^ttgcctctaaagttgctgcttt^ 

gtttcmcagcagccctgagtcagaggcgggctggccttgcaagtagcc^^ 

ggtcagcgccgcgctgcgccgcccggctgactcagcccgggcgggcgggcgggaggctctcgactgggcgggaaggtgcgggaaggttcgcgg 

c ggcggggtcggggaggtgcaaaaggatgaaaagcccgtggacggagctgagcagatccggccgggctggcggcagagaaaccgcagggagag 

cctcactgctgagcgcccctcgacgcgggcggcagcagcctccgtggcctccagcatccgacaa 
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IQGAP2 CpG Island (SEQ ID NO:l 12) 

Agagttcactmacttcagtgtcagcgcgcggcggccgtggctggctctggcgagagagcaccgagggagtgggtcgcagatcttcgg 
gcggctaggggaaatcggcgagaggcgggatccgagcgcgccggcggggcgcagagcccgcgagcctggccagcgagggtagcc 
gcggggggcgcgccccgggcgggcccccggagacgcgcaggatgccacacgaagagctgccgtcgctgcagagaccccgctatgg 
ctctattgtggacgatgaaa 

KL CpG Island (SEQ ID NO:113) 

ctcgaaagaggggcgcgggtgggcgcgtctccccgcgagcatctcacctaagggggaatccctttcagcgcacggcgaagtt 
ggctgtcrcacctggcagtccctctaggatttcg 

gagaaaagtgagagcaggtgcttccccagcggcgcgctccgctagggcccggcaggatcccgcccccaagtcgggaaaagttggt^ 
gcgccttttctccccgacgaagccgctccagggctgctctcagaggacgcgcggcaggcaaagagaatgaacctgagcgtc^ 
gtcctgcacggctcccgggagctgggagaaacaggtgcctttctccgacgtccgcgggcgacgcctgccgcaccttgcccgctg 
cccctcccgggcacccctcgccctcggcgccc^^ 

cagcggagcccgccggggagcgggggtgggcgcgccggcggtgggcgGgcgggcgcggcggggcgcgggcataaaggggcgc 

ggcgcggggccccggagcctggctcccgcgcagcatgcccgccagcgccccgccgcgccgcccgcggcxgccgccgccgtcgctgt 

cgctgctgctggtgctgctgggcctgggcggccgccgcctgcgtgcggagccgggcgacggcgcgcagacctgggcccgtttctcgcg 

gcctcctgcccccgaggccgcgggcctcttccagggcaccttccccgacggcttcctctgggccgtgggcagcgccgcctaccagaccg 

agggcggctggcagcagcacggcaagggtgcgtccatctgggacacgttcacccaccaccccctggcacccccgggagactcccgga 

acgccagtctgccgttgggcgccccgtcgccgctgcagcccgccaccggggacgtagccagcgacagctacaacaacgtcttccgcga 

cacggaggcgctgcgcgagctcggggtcactcactaccgcttctccatctcgtgggcgcgagtgctccccaatggca^ 

ccaacxgcgaggggctgcgctactaccggcgcctgctg^ 

ggacctgccccagcgcctgcaggacgcctacggcggctgggccaa^ 

ttccgccacttcggcggtcaggtcaagtactggatcaccatcgacaacccctacgtggtggcctggcacggctacgccaccgggcg^ 
gcccccggcatccggggcagcccgcggctcgggtacctggtggcgcacaa 

PAR2 CpG Island (SEQ ID NO: 1 14) 

Cccggggcgtggcctcccgcaggtgagtacgctgctcctt^ 

ccgattcggggcaggtgagaggctgacmctctcggtgcgtccagtggagctctgagtttcgaatcggcggcggcggattccccgcgc 
ccggcgtcggggcttccaggaggatgcggagccccagcgcggcgtggctgctgggggccgccatcctgc 

FIG. 4D 
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PITX2 CpG Island (SEQ ID NO: 1 15) 

agtccgtgctcctgctcctcggttggctcctaagtgccccgccaggtcccctctccmcgctctcccggctccggctcccgactcttcggcccgctggca 

ccctgcctcgmctcgtcgcccctgctcgctccccccggcgctcgcccgggcgctgtgctcgctcctggatcgccagccgcgcagccgggcte 

cgcgccactgtgcagtggagmggtggaatctctgctgacgtcacgtcart^^ 

gagagagagtgcgagaccgagcgagaaagctggagaggagcagaaagaaactgccagtggcggctagatttcggaggccccagtgcacccgtggactccttcgga 
acttggcaccctcaggagccctgragtcctctcaggcccggcm^ 

cgcggagacagcagctctctcccggtagccgataacggggaaatggagaccaactgccgcaaactggtgtcggcgtgtctgcaattagagaaagataaaagccagca 
ggggaagaatgaggacgtgggcgccgaggacccgtctaagaagaagcggcaaaggcggcagcgga 

PTCA CpG Island (SEQ ID NO: 1 1 6) 

GCGGCCGCAGCGGCAGCAGCGCCCGCCGTGTGAGCAGCAGCAGCGGCTGGTCTGTCAACCGGAGCCC 

GAGCCCGAGCAGCCTGCGGCCAGCAGCGTCCTCGCAAGCCGAGCGCCCAGGCGCGCCAGGAGCCCGC 

AGCAGCGGCAGCAGCGCGCCGGGCCGCCCGGGAAGCCTCCGTCCCCGCGGCGGCGGCGGCGGCGGCG 

GCAACATGGCCTCGGCTGGTAACGCCGCCGAGCCCCAGGACCGCGGCGGCGGCGGCAGCGGCT 

CGGTGCCCCGGGACGGCCGGCTGGAGGCGGGAGGCGCAGACGGACGGGGGGGCTGCGCCGTGCTGCC 

GCGCCGGACCGGGACTATCTGCACCGGCCCAGCTACTGCGACGCCGCCTTCG 

CCANGGNNGGCATTTCAGACIOTOTCNTTCC^ 

CCCXXXSO^CACACAAACAC^^ 

ACNTCTNCAGGAAAAGCAGCAGACAAATGGGGATTGAAAAATTCA 

GGAAAGGGCTGTCTGAGGTCCGCAGGGGGTGGAGGTGTGTGTGTGTGCGTGTGTGTGTGTGNANACAC 

ACGCCCTCCCTGGTGTGCCTTTTCCGGAGCACTGGAAAGCCGTCCACGGCGGACCACCT 

CCGC 



GCGGTCGTAGCGGTAGTAGCGTTCGTCGTGTGAGTAGTAGTAGCGGTTGGTTTGTTAATCGGAGTTCG 
AGTTCGAGTAGTTTGCGGTTAGTAGCGTTCT^ 

AGCGGTAGTAGCGCGTCGGGTCGTTCGGGAAGTTTTCGTTTTCGCGGCGGCGGC 
TATGGTTTCGGTTGGTAACGTCGTCGAGT^^ 

TTCGGGACGGTCGGTTGGAGGCGGGAGGCGTAGACGGACGGGGGGGTTGCGTCGTGTTGTCGCGTCG 

GATCXjGGATTATTTGTATCGGTTTAGTC 

NGGTATTTTAGATIOTOT^ 

TATATAAATATATATATTNTTTTTTT^^ 

AAAAGTAGTAGATAAATGGGGATTGAAAAATT^^ 

TTGAGGTTC^ 

TGTGTTTTTTTCGGAGTATTGGAAAGTC 



PTCHB CpG Island (SEQ ID NO: 1 1 7) 

GCGGCCGCGGCACTGTCCTrGCCCCGTGCCCCCTGCCCT 
TTGCAGCCTAAACTCCTGTACGGCra 

AGGCGGAG AG AGG AAGGGGGG AGGG AGCCG AAATAAAGGTGGTTTCCT1 ' 1 T 1" 1 GGC AG CC A GTTTTG 
GTTTTGTTGAGCATGAAATCTCTGCT^ 

CCAGGTTTTGAGCCGCCTCrCCTTAGGGCCTGGTCGGGGG 
TAAATTCGCGGTGCGAOTCTGCGGAGCT^ 

GAATCCAGGGAAGGCGAGCACCCAGACGGGGGCCCGCCGGGGCCGCGGCCAGCGCCGGGGAAATGC 
CGCGCCGGGGAGCAGCATGCGCCGGCCTGAGCCCTTCCCT 

AGAAAGGAAGGGAGAGGAGGGAAAGATCCATGTGGCTGCCCTCTTCCGATCACAAATATTGTCGTAA 

GTTGCAGCTGGCTGCCCX^Al^nTCCrAATTC 

GGAGTGAACTCCGGCGGCCGC 

FIG. 4E 
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SDC1 CpG Island (SEQ ID NO:l 18) 

Ggagaggtgcgggccgaatccgagccgagcgagaggaatccggcagtagagagcggactccagccggcggaccctgcagccctcg 
cctgggacagcggcgcgctgggcaggcgcccaagagagcatcgagcagcggaacccgcgaagccggcccgcagccgcgacccgc 
gcagcctgccgctctcccgccgccggtccgggcagcatgaggcgcgcggcgctctggctctggctgtgcgcgctggcgctgagcctgc 
agctggccctgccgca 



SDC4 CpG Island (SEQ ID NO: 119) 
Agtaggagccggcgggctcgggcagggcgggtcxcttggggm^ 

actggggaattccgggcggggtgcgggcggcggggcgggggcgggccggggcggggccggtaggccgcctataagatgggtggcg 
cgcccgcccggggccactcgccgcagcctgcgcgccttctccagtccgcggtgccatggcccccgcccgtctgttcg 
cttcgtaggcggagtcgccgagtcggtgggtgcttggaggttcccgggctgggggcgaagcgggggcgcaggccggtgcctc 
cgtcggagcgtgggatgggggggggcagatcgggggtacgctaccccc^^ 

gctccggggtcacgggccagcctccgggatggcttcacgcgccgtgcgcccctcgcctgttgctcttcccgcctccccgggcctcagc 

cgccgcgggctacgggctcgttagtgactaagccggtgte 

cttggagcgctgaatcccctcctcgctctcggggcgcccag 

gtgtactcggaagggggagggttttagggttgtgcgaggccc 

FIG. 4F 
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MUNTT31 (SEQ ID NO : 120) 
CCCGGGGCCT CTATCCTGGC 
AGGGAAGAAG GTGTCAGACG 
GCACGGGGTT CAAGACTCAG 
AAAAAAGGGC ACTTAGGGGA 
TCATTTACAC AACTCTGTGC 
GGAGCGACTC TGCCACTTCC 
CGACCCCAAC CCTCTGCGGC 
AGGGGAGTGA ACAACGCCCA 
TCATCCGACC CATCCCGCCG 
NGGAACTGAG TGACCCACTA 
CTTCCCGGCG TCTAATCCTC 
CTCCGCTCCC GGG 



GGGAAGGGCA GGCCGACCCG 

CGCGGAGCAA CCATAAATAG 

GCGCCGCATA CTCAGAATGA 

TCTGCTCATT AACATGAAAT 

ATGGATTCGG CGAAAGGGCA 

CCCATCCCCT CCCCCCCATC 

GGCGGGAGGT GCGCGGGGGC 

GGGGTGATGG CCTCAGCAAA 

CCTCTCCGGA GTGGCGCAAG 

GGTTCCGCCG TGTCTACCTC 

GCTGTTCCCC CCTCCACCGG 

FIG. 5 



GCAGACTGCG GCCTCTCGGG 
CCCCCCTTTC CCAGAAGACG 
GAGCAGAGAC TCCCGCCAGG 
GCAAATGAGG CCGCCCGGCC 
ACCAGGGAGA CGACGGCGCA 
GGCCGGGGCG GGAACTGAGA 
TGCGTGGGTG GTGCAGCCTT 
GTGAGGGGTG GTGATGGAGG 
CGCCCCAAAA TCTCCGGAGA 
TCGGAGATGT TGGGGAAGTG 
CGCCCAGCAC ACCCGCGGCG 
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SEQUENCE LISTING 



<110> The Johns Hopkins University School of Medicine 

<12 0> CACNA1G POLYNUCLEOTIDE POLYPEPTIDE AND 
METHODS OF USE THEREFOR 



<130> JHU1590WO 

<140> 09/398,522 
<141> 1999-09-15 

<160> 120 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<400> 1 

gtttttgtag tttgggtttt tt 22 

<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r a G or A 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 

<222> (0) ... (0) 

<223> y = C or T 

<400> 3 

ttgtggygtt ggygatagtt 20 



<400> 



2 



rccaacaacr ccaacaac 



18 



<210> 4 
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<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 4 

acraaaaaaa aaaaaaaaaa tctctt 26 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_f eature 
<222> (0) ... (0) 
<223> y = C or T 

<400> 5 

gggggygttt tttttyggat ttt 23 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 6 

ttcccctacr cccctaaaac ttcc 24 

<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulfite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 7 

gtaggttaga gggagggaty gtt 2 3 



<210> 8 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 8 

aaaacaaacc tcaccatact acct 24 

<210> 9 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 9 

tttgatttyg tttagtattg at 22 

<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 10 

cccttacctt tcttttcct 19 

<210> 11 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 11 

tttttttatt gygttaattt tg 22 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 



<400> 12 
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tttccccacc tcttcaaata 20 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 13 

ggttaggtgg ggtaagagtt t 21 

<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r - G or A 

<400> 14 

aacrtttaat ccaattacaa acc 23 

<210> 15 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 15 

ttttttagta gttttgagtt agagg 25 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 16 

tttcatcctt ttacacctcc c 21 

<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 



<400> 17 

tttattttta ttttagtgtt ag 



22 
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<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> mis cofeature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 18 

ctcttcrtat aacatcctac 20 

<210> 19 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> y = C or T 

<400> 19 

gtagtatgtt ygttagygtt t 21 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 20 

aaatacccta aaaaaaaccc 2 0 

<210> 21 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223 > y - C or T 

<400> 21 

yggttttttt gaaatttaat tc 22 

<210> 22 
<211> 19 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc__f eature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 22 

aactccrcat cctcctaaa 19 

<210> 23 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 23 

taagtgttty gttaggtttt tt 22 

<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 24 

ccaaactcca ctacacaata ac 22 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 25 

9"ttggtttgt taatyggagt 2 0 

<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Bisulf ite-PCR primer 
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<221> misc__f eature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 26 

ttaccaaccr aaaccatatt 2 

<210> 27 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 27 

aatgtgtgga atttaggga 1 

<210> 28 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 28 

taaaacaacc aactacaact tac 2 

<210> 29 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 

<221> mi sc_f eature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 29 

agaggaatty ggtagtagag ag 2 

<210> 30 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 30 

cacaaccaaa accaaaac 1 

<210> 31 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Bisulf ite-PCR primer 
<400> 31 

ggggatttgt ttggtagtgg 2 0 

<210> 32 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bisulf ite-PCR primer 
<400> 32 

cccgaaattc cccaataaa 19 

<210> 33 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<221> mi sc_f eature 
<222> (0) ... (0) 
<223> y - C or T 



<400> 33 

gayggygtag tagttatttt gtt 23 

<210> 34 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<400> 34 

catcaccacc cctcacttta c 21 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<221> mi sc_f eature 
<222> (0) ... (0) 
<223> y = C or T 



<400> 35 
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ttygggtatt tatagttttt tggag 25 

<210> 36 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc__f eature 
<222> (0) ... (0) 
<223> r » G or A 

<400> 36 

aatctaccrc cttcactcac tc 22 

<210> 37 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<40 0> 3 7 

tttaggagyg ttaatgtgag gtt 23 

<210> 38 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<400> 38 

ctaaaaaaac ccaatcttaa aaaaac 2 6 

<210> 39 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<400> 39 

tggataaagg atgtttgggg tttg 24 



<210> 40 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<400> 40 

ccctcccctt acccctaaat cc 22 

<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<221> misc_f eature 
<222> (0) ... (0) 
<223> y = C or T 



<400> 41 

aatyggattt tagttgtggt tttt 24 

<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<400> 42 

cacaccacaa ctaaatccct cact 24 

<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<221> misc_feature 
<222> (0) ... (0) 
<223> y - C or T 

<400> 43 

ttgtggygtt ggygatagtt 2 0 

<210> 44 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_f eature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 44 

acraaaaaaa aaaaaaaaaa tctctt 2 6 

<210> 45 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_feature 
<222> (0) ... (0) 
<223> y = C or T 

<400> 45 

gggggygttt tttttyggat ttt 23 

<210> 46 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 46 

ttcccctacr cccctaaaac ttcc 24 

<210> 47 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<400> 47 

gggagtttgg gagttgtatt ttgtt 25 

<210> 48 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 



<400> 48 

aaccaaatta aaaaatcaaa ccctaa 



26 



<210> 49 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_feature 
<222> (0) ... (0) 
<223> y = C or T 



<400> 49 

gaggggggat ygtaattttt tg 



22 



<210> 50 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer useful for bisulf ite/PCR analysis of 
CACNA1G 

<221> misc_f eature 
<222> (0) ... (0) 
<223> r = G or A 



<400> 50 

ccraaatctc cttattatac ctccaa 



26 



<210> 51 

<211> 3993 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CAGNA1G - a gene encoding a T-type calcium channel 



<221> CDS 
<222> (373) . 



. (3993) 



<400> 51 

ccttttcgtt cgccctctcg gggcggcttc gccgaaggta gcgccgaatc cggcaaccgg 60 

agcctgggcg cgaagcgaag aagccggaac aaagtgaggg ggagccggcc ggctggcccg 12 0 

ggaagcccca ggggcgcagg ggaagcggga ctcgcgccgg gcggggtttc cctgcgcccc 180 

ggcgccccgc gggcagcatg cccctgcggg cagggggagc tgggctgaac tggccctccc 240 

gggggctcag cttgcgccct agagcccacc agatgtgccc ccgccggggc ccccgggttg 3 00 

cgtgaggaca cctcctctga ggggcgccgc ttgcccctct ccggatcgcc cggggccccg 3 60 

gctggccaga gg atg gac gag gag gag gat gga gcg ggc gcc gag gag teg 411 
Met Asp Glu Glu Glu Asp Gly Ala Gly Ala Glu Glu Ser 
15 10 
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gga cag ccc egg age ttc atg egg etc aac gac ctg teg ggg gee ggg 459 

Gly Gin Pro Arg Ser Phe Met Arg Leu Asn Asp Leu Ser Gly Ala Gly 

15 20 ~ 25 

ggc egg ccg ggg ccg ggg tea gca gaa aag gac ccg ggc age gcg gac 5 07 

Gly Arg Pro Gly Pro Gly Ser Ala Glu Lys Asp Pro Gly Ser Ala Asp 

30 35 40 45 

tec gag gcg gag ggg ctg ccg tac ccg gcg ctg gec ccg gtg gtt ttc 555 

Ser Glu Ala Glu Gly Leu Pro Tyr Pro Ala Leu Ala Pro Val Val Phe 

50 55 60 

ttc tac ttg age cag gac age cgc ccg egg age tgg tgt etc cgc acg 603 

Phe Tyr Leu Ser Gin Asp Ser Arg Pro Arg Ser Trp Cys Leu Arg Thr 

65 70 75 

gtc tgt aac ccc tgg ttt gag cgc ate age atg ttg gtc ate ctt etc 651 

Val Cys Asn Pro Trp Phe Glu Arg lie Ser Met Leu Val lie Leu Leu 

80 85 90 

aac tgc gtg ace ctg ggc atg ttc egg cca tgc gag gac ate gee tgt 699 

Asn Cys Val Thr Leu Gly Met Phe Arg Pro Cys Glu Asp lie Ala Cys 

95 100 105 

gac tec cag cgc tgc egg ate ctg cag gee ttt gat gac ttc ate ttt 747 

Asp Ser Gin Arg Cys Arg lie Leu Gin Ala Phe Asp Asp Phe lie Phe 

110 115 120 125 

gee ttc ttt gee gtg gag atg gtg gtg aag atg gtg gee ttg ggc ate 795 

Ala Phe Phe Ala Val Glu Met Val Val Lys Met Val Ala Leu Gly lie 

130 135 140 

ttt ggg aaa aag tgt tac ctg gga gac act tgg aac egg ctt gac ttt 843 

Phe Gly Lys Lys Cys Tyr Leu Gly Asp Thr Trp Asn Arg Leu Asp Phe 

145 150 155 

ttc ate gtc ate gca ggg atg ctg gag tac teg ctg gac ctg cag aac 8 91 

Phe lie Val lie Ala Gly Met Leu Glu Tyr Ser Leu Asp Leu Gin Asn 

160 165 170 

gtc age ttc tea get gtc agg aca gtc cgt gtg ctg cga ccg etc agg 93 9 

Val Ser Phe Ser Ala Val Arg Thr Val Arg Val Leu Arg Pro Leu Arg 

175 180 185 

gee att aac egg gtg ccc age atg cgc ate ctt *gtc acg ttg ctg ctg 987 

Ala lie Asn Arg Val Pro Ser Met Arg lie Leu Val Thr Leu Leu Leu 

190 195 200 205 

gat acg ctg ccc atg ctg ggc aac gtc ctg ctg etc tgc ttc ttc gtc 1035 

Asp Thr Leu Pro Met Leu Gly Asn Val Leu Leu Leu Cys Phe Phe Val 

210 215 220 

ttc ttc ate ttc ggc ate gtc ggc gtc cag ctg tgg gca ggg ctg ctt 1083 

Phe Phe lie Phe Gly lie Val Gly Val Gin Leu Trp Ala Gly Leu Leu 

225 230 235 



egg aac cga tgc ttc eta cct gag aat ttc age etc ccc ctg age gtg 1131 
Arg Asn Arg Cys Phe Leu Pro Glu Asn Phe Ser Leu Pro Leu Ser Val 
240 245 250 
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gac ctg gag cgc tat tac cag aca gag aac gag gat gag age ccc ttc 1179 
Asp Leu Glu Arg Tyr Tyr Gin Thr Glu Asn Glu Asp Glu Ser Pro Phe 
255 260 265 

ate tgc tec cag cca cgc gag aac ggc atg egg tec tgc aga age gtg 1227 
lie Cys Ser Gin Pro Arg Glu Asn Gly Met Arg Ser Cys Arg Ser Val 
270 275 280 285 

ccc acg ctg cgc ggg gac ggg ggc ggt ggc cca cct tgc ggt ctg gac 1275 
Pro Thr Leu Arg Gly Asp Gly Gly Gly Gly Pro Pro Cys Gly Leu Asp 
290 295 300 

tat gag gee tac aac age tec age aac ace ace tgt gtc aac tgg aac 1323 
Tyr Glu Ala Tyr Asn Ser Ser Ser Asn Thr Thr Cys Val Asn Trp Asn 
305 310 315 

cag tac tac ace aac tgc tea gcg ggg gag cac aac ccc ttc aag ggc 13 71 

Gin Tyr Tyr Thr Asn Cys Ser Ala Gly Glu His Asn Pro Phe Lys Gly 
320 325 330 

gee ate aac ttt gac aac att ggc tat gec tgg ate gee ate ttc cag 1419 
Ala He Asn Phe Asp Asn He Gly Tyr Ala Trp He Ala He Phe Gin 
335 340 345 

gtc ate acg ctg gag ggc tgg gtc gac ate atg tac ttt gtg atg gat 14 67 

Val He Thr Leu Glu Gly Trp Val Asp He Met Tyr Phe Val Met Asp 
350 355 360 365 

get cat tec ttc tac aat ttc ate tac ttc ate etc etc ate ate gtg 1515 
Ala His Ser Phe Tyr Asn Phe lie Tyr Phe He Leu Leu He lie Val 
370 375 380 

ggc tec ttc ttc atg ate aac ctg tgc ctg gtg gtg att gee acg cag 1563 
Gly Ser Phe Phe Met He Asn Leu Cys Leu Val Val He Ala Thr Gin 
385 390 395 

ttc tea gag acc aag cag egg gaa age cag ctg atg egg gag cag cgt 1611 
Phe Ser Glu Thr Lys Gin Arg Glu Ser Gin Leu Met Arg Glu Gin Arg 
400 405 410 

gtg egg ttc ctg tec aac gee age acc ctg get age ttc tct gag ccc 1659 
Val Arg Phe Leu Ser Asn Ala Ser Thr Leu Ala Ser Phe Ser Glu Pro 
415 420 425 

ggc age tgc tat gag gag ctg etc aag tac ctg gtg tac ate ctt cgt 1707 
Gly Ser Cys Tyr Glu Glu Leu Leu Lys Tyr Leu Val Tyr He Leu Arg 
430 435 440 445 

aag gca gee cgc agg ctg get cag gtc tct egg gca gca ggt gtg egg 1755 
Lys Ala Ala Arg Arg Leu Ala Gin Val Ser Arg Ala Ala Gly Val Arg 
450 455 460 

gtt ggg ctg etc age age cca gca ccc etc ggg ggc cag gag acc cag 1803 
Val Gly Leu Leu Ser Ser Pro Ala Pro Leu Gly Gly Gin Glu Thr Gin 
465 470 475 

ccc age age age tgc tct cgc tec cac cgc cgc eta tec gtc cac cac 1851 
Pro Ser Ser Ser Cys Ser Arg Ser His Arg Arg Leu Ser Val His His 
480 485 490 
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ctg gtg cac cac cac cac cac cat cac cac cac tac cac ctg ggc aat 18 99 

Leu Val His His His His His His His His His Tyr His Leu Gly Asn 
495 500 505 

ggg acg etc agg gec ccc egg gec age ccg gag ate cag gac agg gat 1947 
Gly Thr Leu Arg Ala Pro Arg Ala Ser Pro Glu He Gin Asp Arg Asp 
510 515 520 525 

gec aat ggg tec cgc egg etc atg ctg cca cca ccc teg acg cct gee 1995 
Ala Asn Gly Ser Arg Arg Leu Met Leu Pro Pro Pro Ser Thr Pro Ala 
530 535 540 

etc tec ggg gee ccc cct ggt ggc gca gag tct gtg cac age ttc tac 2043 
Leu Ser Gly Ala Pro Pro Gly Gly Ala Glu Ser Val His Ser Phe Tyr 
545 550 555 

cat gee gac tgc cac tta gag cca gtc cgc tgc cag gcg ccc cct ccc 2091 
His Ala Asp Cys His Leu Glu Pro Val Arg Cys Gin Ala Pro Pro Pro 
560 565 570 

agg tec cca tct gag gca tec ggc agg act gtg ggc age ggg aag gtg 213 9 

Arg Ser Pro Ser Glu Ala Ser Gly Arg Thr Val Gly Ser Gly Lys Val 
575 580 585 

tat ccc ace gtg cac ace age cct cca ccg gag acg ctg aag gag aag 2187 
Tyr Pro Thr Val His Thr Ser Pro Pro Pro Glu Thr Leu Lys Glu Lys 
590 595 600 605 

gca eta gta gag gtg get gee age tct ggg ccc cca ace etc ace age 2235 
Ala Leu Val Glu Val Ala Ala Ser Ser Gly Pro Pro Thr Leu Thr Ser 
610 615 620 

etc aac ate cca ccc ggg ccc tac age tec atg cac aag ctg ctg gag 2283 
Leu Asn He Pro Pro Gly Pro Tyr Ser Ser Met His Lys Leu Leu Glu 
625 630 635 

aca cag agt aca ggt gee tgc caa age tct tgc aag ate tec age cct 2331 
Thr Gin Ser Thr Gly Ala Cys Gin Ser Ser Cys Lys He Ser Ser Pro 
640 645 650 

tgc ttg aaa gca gac agt gga gee tgt ggt cca gac age tgc ccc tac 23 79 

Cys Leu Lys Ala Asp Ser Gly Ala Cys Gly Pro Asp Ser Cys Pro Tyr 
655 660 665 

tgt gee egg gec ggg gca ggg gag gtg gag etc gee gac cgt gaa atg 2427 
Cys Ala Arg Ala Gly Ala Gly Glu Val Glu Leu Ala Asp Arg Glu Met 
670 675 680 685 

cct gac tea gac age gag gca gtt tat gag ttc aca cag gat gec cag 2475 
Pro Asp Ser Asp Ser Glu Ala Val Tyr Glu Phe Thr Gin Asp Ala Gin 
690 695 700 

cac age gac etc egg gac ccc cac age egg egg caa egg age ctg ggc 2 523 

His Ser Asp Leu Arg Asp Pro His Ser Arg Arg Gin Arg Ser Leu Gly 
705 710 715 

cca gat gca gag ccc age tct gtg ctg gee ttc tgg agg eta ate tgt 2571 
Pro Asp Ala Glu Pro Ser Ser Val Leu Ala Phe Trp Arg Leu He Cys 
720 725 730 
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gac acc ttc cga aag att gtg gac age aag tac ttt ggc egg gga ate 2619 

Asp Thr Phe Arg Lys lie Val Asp Ser Lys Tyr Phe Gly Arg Gly He 

735 740 745 

atg ate gee ate ctg gtc aac aca etc age atg ggc ate gaa tac cac 2667 

Met He Ala He Leu Val Asn Thr Leu Ser Met Gly He Glu Tyr His 
750 755 760 765 

gag cag ccc gag gag ctt acc aac gec eta gaa ate age aac ate gtc 2715 

Glu Gin Pro Glu Glu Leu Thr Asn Ala Leu Glu He Ser Asn He Val 
770 775 780 

ttc acc age etc ttt gee ctg gag atg ctg ctg aag ctg ctt gtg tat 2763 

Phe Thr Ser Leu Phe Ala Leu Glu Met Leu Leu Lys Leu Leu Val Tyr 

785 790 795 

ggt ccc ttt ggc tac ate aag aat ccc tac aac ate ttc gat ggt gtc 2 811 

Gly Pro Phe Gly Tyr He Lys Asn Pro Tyr Asn He Phe Asp Gly Val 

800 805 810 

att gtg gtc ate age gtg tgg gag ate gtg ggc cag cag ggg ggc ggc 2 859 

He Val Val He Ser Val Trp Glu He Val Gly Gin Gin Gly Gly Gly 

815 820 825 

ctg teg gtg ctg egg acc ttc cgc ctg atg cgt gtg ctg aag ctg gtg 2907 

Leu Ser Val Leu Arg Thr Phe Arg Leu Met Arg Val Leu Lys Leu Val 
830 835 840 845 

cgc ttc ctg ccg gcg ctg cag egg cag ctg gtg gtg etc atg aag acc 2 955 

Arg Phe Leu Pro Ala Leu Gin Arg Gin Leu Val Val Leu Met Lys Thr 
850 855 860 

atg gac aac gtg gee acc ttc tgc atg ctg ctt atg etc ttc ate ttc 3003 

Met Asp Asn Val Ala Thr Phe Cys Met Leu Leu Met Leu Phe He Phe 

865 870 875 

ate ttc age ate ctg ggc atg cat etc ttc ggc tgc aag ttt gee tct 3 051 

He Phe Ser He Leu Gly Met His Leu Phe Gly Cys Lys Phe Ala Ser 

880 885 890 

gag egg gat ggg gac acc ctg cca gac egg aag aat ttt gac tec ttg 3 0 99 

Glu Arg Asp Gly Asp Thr Leu Pro Asp Arg Lys Asn Phe Asp Ser Leu 

895 900 905 

etc tgg gec ate gtc act gtc ttt cag ate ctg acc cag gag gac tgg 3147 

Leu Trp Ala He Val Thr Val Phe Gin He Leu Thr Gin Glu Asp Trp 
910 915 920 925 

aac aaa gtc etc tac aat ggt atg gec tec acg teg tec tgg gcg gec 3195 

Asn Lys Val Leu Tyr Asn Gly Met Ala Ser Thr Ser Ser Trp Ala Ala 
930 935 940 

ctt tat ttc att gee etc atg acc ttc ggc aac tac gtg etc ttc aat 3243 

Leu Tyr Phe He Ala Leu Met Thr Phe Gly Asn Tyr Val Leu Phe Asn 

945 950 955 

ttg ctg gtc gee att ctg gtg gag ggc ttc cag gcg gag gga gat gee 3291 

Leu Leu Val Ala He Leu Val Glu Gly Phe Gin Ala Glu Gly Asp Ala 

960 965 970 
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aac aag tec gaa tea gag ccc gat ttc 

Asn Lys Ser Glu Ser Glu Pro Asp Phe 
975 980 

gat ggg gac agg aag aag tgc ttg gec 

Asp Gly Asp Arg Lys Lys Cys Leu Ala 
990 995 
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ttc tea ccc age ctg gat ggt 3339 
Phe Ser Pro Ser Leu Asp Gly 
985 

ttg gtg tec ctg gga gag cac 3387 
Leu Val Ser Leu Gly Glu His 
1000 1005 



ccg gag ctg egg aag age ctg ctg ccg cct etc ate ate cac acg gec 3435 
Pro Glu Leu Arg Lys Ser Leu Leu Pro Pro Leu lie lie His Thr Ala 
1010 1015 1020 

gec aca ccc atg teg ctg ccc aag age acc age acg ggc ctg ggc gag 3483 
Ala Thr Pro Met Ser Leu Pro Lys Ser Thr Ser Thr Gly Leu Gly Glu 
1025 1030 1035 

gcg ctg ggc cct gcg teg cgc cgc acc age age age ggg teg gca gag 3531 
Ala Leu Gly Pro Ala Ser Arg Arg Thr Ser Ser Ser Gly Ser Ala Glu 
1040 1045 1050 

cct ggg gcg gee cac gag atg aag tea ccg ccc age gec cgc age tct 3579 
Pro Gly Ala Ala His Glu Met Lys Ser Pro Pro Ser Ala Arg Ser Ser 
1055 1060 1065 

ccg cac age ccc tgg age get gca age age tgg acc age agg cgc tec 3 62 7 

Pro His Ser Pro Trp Ser Ala Ala Ser Ser Trp Thr Ser Arg Arg Ser 
1070 1075 1080 1085 

age egg aac age etc ggc cgt gca ccc age ctg aag egg aga age cca 3 675 

Ser Arg Asn Ser Leu Gly Arg Ala Pro Ser Leu Lys Arg Arg Ser Pro 
1090 1095 1100 

a 9t gga gag egg egg tec ctg ttg teg gga gaa ggc cag gag age cag 3 723 

Ser Gly Glu Arg Arg Ser Leu Leu Ser Gly Glu Gly Gin Glu Ser Gin 
1105 1110 1115 

gat gaa gag gag age tea gaa gag gag egg gee age cct gcg ggc agt 3 771 

Asp Glu Glu Glu Ser Ser Glu Glu Glu Arg Ala Ser Pro Ala Gly Ser 
1120 1125 1130 

gac cat cgc cac agg ggg tec ctg gag egg gag gec aag agt tec ttt 3 819 

Asp His Arg His Arg Gly Ser Leu Glu Arg Glu Ala Lys Ser Ser Phe 
1135 1140 1145 

gac ctg cca gac aca ctg cag gtg cca ggg ctg cat cgc act gee agt 3 867 

Asp Leu Pro Asp Thr Leu Gin Val Pro Gly Leu His Arg Thr Ala Ser 
1150 1155 1160 1165 

ggc cga ggg tct get tct gag cac cag gac tgc aat ggc aag teg get 3 915 

Gly Arg Gly Ser Ala Ser Glu His Gin Asp Cys Asn Gly Lys Ser Ala 
1170 1175 1180 

tea ggg cgc ctg gee egg gec ctg egg cct gat gac ccc cca ctg gat 3 963 

Ser Gly Arg Leu Ala Arg Ala Leu Arg Pro Asp Asp Pro Pro Leu Asp 
1185 1190 1195 



999 9 at 9 ac 9" cc 9 at gac gag ggc aac ctg 
Gly Asp Asp Ala Asp Asp Glu Gly Asn Leu 
1200 1205 
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<220> 

<223> CACNA1G - a gene ecoding a T-type calcium channel 



<400> 52 



Met 


Asp 


Glu 


Glu 


Glu 


Asp 


Gly 


Ala 


Gly 


Ala 


Glu 


Glu 


Ser 


Gly 


Gin 


Pro 


1 








5 










10 










15 




Arg 


Ser 


Phe 


Met 


Arg 


Leu 


Asn 


Asp 


Leu 


Ser 


Gly 


Ala 


Gly 


Gly Arg 


Pro 








20 










25 










3 0 






Glv 


Pro 


Glv 


Ser 


Ala 


Glu 


Lvs 


Asp 


Pro 


Gly 


Ser 


Ala 


Asp 


Ser 


Glu 


Ala 






35 










40 










45 








Glu 


Glv 


Leu 


Pro 


Tyr 


Pro 


Ala 


Leu 


Ala 


Pro 


Val 


Val 


Phe 


Phe 




Leu 




50 










55 










60 










Seir 


Gin 


Asp 


Ser 


Arg 


Pro 


Arg 


Ser 


Trp 


Cvs 


Leu 


Arg 


Thr 


Val 


Cys 


Asn 


65 










70 










75 










80 


Pro 


TrD 


Phe 


Glu 


Arg 


He 


Ser 


Met 


Leu 


Val 


He 


Leu 


Leu 


Asn 


Cys 


Val 










85 










90 










95 




Thr 


Leu 


Glv 


Met 


Phe 


Arg 


Pro 


Cvs 


Glu 


Asp 


He 


Ala 


Cys 


Asp 


Ser 


Gin 








10 0 










105 










110 






A TO 


Cys 


Arg 


He 


Leu 


Gin 


Ala 


Phe 


Asp 


Asp 


Phe 


He 


Phe 


Ala 


Phe 


Phe 






115 










120 










125 








Ala 


Val 


Glu 


Met 


Val 


Val 


Lys 


Met 


Val 


Ala 


Leu 


Gly 


He 


Phe 


Glv 


Lys 




13 0 










135 










140 










Lys 


Cys 


Tvr 


Leu 


Gly 


Asp 


Thr 


Trp 


Asn 


Arg 


Leu 


Asp 


Phe 


Phe 


He 


Val 


145 










150 










155 










160 


lie 


Ala 


Gly 


Met 


Leu 


Glu 


Tyr 


Ser 


Leu 


Asp 


Leu 


Gin 


Asn 


Val 


Ser 


Phe 










165 










170 










175 




Ser 


Ala 


Val 


Arg 


Thr 


Val 


Arg 


Val 


Leu 


Arg 


Pro 


Leu 


Arg 


Ala 


He 


Asn 








180 










185 










190 






Arg 


Val 


Pro 


Ser 


Met 


Arcr 


He 


Leu 


Val 


Thr 


Leu 


Leu 


Leu 


Asp 


Thr 


Leu 






195 










200 










205 








Pro 


Met 


Leu 


Gly 


Asn 


Val 


Leu 


Leu 


Leu 


Cys 


Phe 


Phe 


Val 


Phe 


Phe 


He 




210 










215 










220 










Phe 


Gly 


He 


Val 


Gly 


Val 


Gin 


Leu 


Trp 


Ala 


Gly 


Leu 


Leu 


Arg 


Asn 


Arg 


225 










230 










235 










240 


Cys 


Phe 


Leu 


Pro 


Glu 


Asn 


Phe 


Ser 


Leu 


Pro 


Leu 


Ser 


Val 


Asp 


Leu 


Glu 










245 










250 










255 




Arg 


Tyr 


Tyr 


Gin 


Thr 


Glu 


Asn 


Glu 


Asp 


Glu 


Ser 


Pro 


Phe 


He 


Cys 


Ser 








260 










265 










270 






Gin 


Pro 


Arg 


Glu 


Asn 


Gly 


Met 


Arg 


Ser 


Cys 


Arg 


Ser 


Val 


Pro 


Thr 


Leu 






275 










280 










285 








Arg 


Gly 


Asp 


Gly 


Gly 


Gly 


Gly 


Pro 


Pro 


Cys 


Gly 


Leu 


Asp 


Tyr 


Glu 


Ala 




290 










295 










300 










Tyr 


Asn 


Ser 


Ser 


Ser 


Asn 


Thr 


Thr 


Cys 


Val 


Asn 


Trp 


Asn 


Gin 


Tyr 


Tyr 


305 










310 










315 










320 


Thr 


Asn 


Cys 


Ser 


Ala 


Gly 


Glu 


His 


Asn 


Pro 


Phe 


Lys 


Gly 


Ala 


He 


Asn 










325 










330 










335 




Phe 


Asp 


Asn 


He 


Gly 


Tyr 


Ala 


Trp 


He 


Ala 


He 


Phe 


Gin 


Val 


He 


Thr 








340 










345 










350 






Leu 


Glu 


Gly 


Trp 


Val 


Asp 


He 


Met 


Tyr 


Phe 


Val 


Met 


Asp 


Ala 


His 


Ser 






355 










360 










365 








Phe 


Tyr 


Asn 


Phe 


He 


Tyr 


Phe 


He 


Leu 


Leu 


He 


He 


Val 


Gly 


Ser 


Phe 




370 










375 










380 










Phe 


Met 


He 


Asn 


Leu 


Cys 


Leu 


Val 


Val 


He 


Ala 


Thr 


Gin 


Phe 


Ser 


Glu 



385 390 395 400 
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Thir 


Lys 


Gin 


Arg 


Glu 


Ser 


Gin 


Leu 










405 








Leu 


Ser 


Asii 


Ala 


Ser 


Thr 


Leu 


Ala 








42 0 










Tvx 

J; 


Glu 


Glu 


Leu 


Leu 


Lys 


Tvr 


Leu 






435 










440 


t\±- y 


Arg 


Leu 


Ala 


Gin 


Val 


Ser 


Arg 




45 0 










455 




Leu. 


Ser 


Ser 


Pro 


Ala 


Pro 


Leu 


Gly 


4 R 










4 70 






Ser 


Cys 


Ser 


Atct 
rij - y 


Ser 


His 


Arg 


Ara 










4 8 5 








His 


His 


His 


His 


His 


His 


His 


His 








5 00 










Arg 


Ala 


Pro 




Ala 


Ser 


Pro 


Glu 






—) j_ o 










520 


Ser 


Ai~cr 
^.j- y 


Arg 


Leu 


Met 


Leu 


Pro 


Pro 




53 0 










535 




Ala 


Pro 


Pro 


Gly 


Gly 


Ala 


Glu 


Ser 


545 










55 0 






Cys 


His 


Leu 


Glu 


Pro 


Val 


Arg 


Cys 










5 65 








Ser 


Glu 


Ala 


Ser 


Gly 


Arg 


Thr 


Val 








^ o n 










Val 


His 


Thr 


Ser 


Pro 


Pro 


Pro 


Glu 
















600 


Glu 


Val 


Ala 


Ala 


Ser 


Ser 


Glv 


Pro 




D -L L/ 










615 




XT J-<~> 


XT J_ 


Gly 


Pro 


Tvr 


Ser 


Ser 


Met 


D Z O 
















Thr 


Gly 


Ala 


Cys 


Gin 


Ser 


Ser 


Cys 










645 








Ala 


Asp 


Ser 


Glv 


Ala 


Cys 


Glv 


Pro 








660 










Jr\-i- d 




Ala 


Gly 


Glu 


Val 


Glu 


Leu 






675 










680 


Asp 


Ser 


Glu 


Ala 


Val 


Tvr 


Glu 


Phe 




6 90 










695 




LeU 


7\ -v~r~r 
riX y 




It A. \J 


Hi a 
rxa. o 


Ser 


Arg 


Arg 












710 






Glu 


Pro 


Ser 


Ser 


Val 


Leu 


Ala 


Phe 










725 








Arg 


Lys 


lie 


Val 


Asp 


Ser 


Lys 


Tvr 








74 0 










lie 


Leu 


Val 


Asii 


Thr 


Leu 


Ser 


Met 






75 5 










76 0 


Glu 


Glu 


Leu 


Thr 


Asn 


Ala 


Leu 


Glu 














775 




Leu 


Phe 


Ala 


Leu 


Glu 


Met 


Leu 


Leu 


/ O 3 










790 






Gly 


i i 1 


lie 


J - J j ° 


Asn 


Pro 


Tvr 


Asn 










8 05 








T"| ~ 

A. J. C 




Val 


Trp 


Glu 


lie 


Val 


Glv 








82 0 










Leu 


Arg 


Thr 


Phe 


Arg 


Leu 


Met 


Arg 






835 










840 


Pro 


Ala 


Leu 


Gin 


Arg 


Gin 


Leu 


Val 




850 










855 




Val 


Ala 


Thr 


Phe 


Cys 


Met 


Leu 


Leu 


865 










870 







19 



Met 


Ara 


Glu 


Gin 


Arg 


Val 


Arg 


Phe 




410 










415 




Ser 


Phe 


Ser 


Glu 


Pro 


Gly 


Ser 


Cvs 

jr 


425 










430 






Val 


Tvr 

J: 


lie 


Leu 


Arg 


Lys 


Ala 


Ala 










445 








Ala 


Ala 


Gly 


Val 


Arg 


Val 


Gly 


Leu 








460 










Glv 


Gin 


Glu 


Thr 


Gin 


Pro 


Ser 


Ser 






475 










480 


Leu 


Ser 


Val 


His 


His 


Leu 


Val 


His 




490 










495 




Tvr 


His 


Leu 


Gly 


Asn 


Glv 
- 1 - Jr 


Thr 


Leu 


505 










510 






lie 


Gin 


Asp 


Arg 


Asp 


Ala 


Asn 


Glv 

v - f ->- J 










525 








Pro 


Ser 


Thr 


Pro 


Ala 


Leu 


Ser 


Gly 








540 










Val 


His 


Ser 


Phe 


Tvr 


His 


Ala 


Asp 






555 










560 


Gin 


Ala 


Pro 


Pro 


Pro 


Arg 


Ser 


Pro 




570 










575 




Glv 


Ser 


Glv 


Lys 


Val 


Tvr 

JL 


Pro 


Thr 


585 










590 






Thr 


Leu 


Lvs 


Glu 


Lvs 

Jr 


Ala 


Leu 


Val 










605 








Pro 


Thr 


Leu 


Thr 


Ser 


Leu 


Asn 


He 








620 










His 


Lys 


Leu 


Leu 


Glu 


Thr 


Gin 


Ser 






635 










640 


Lys 


lie 


Ser 


Ser 


Pro 


Cvs 


Leu 


Lvs 

Jr 




650 










655 




Asp 


Ser 


Cvs 


Pro 


Tvr 


Cvs 


Ala 


Arg 


665 










670 






Ala 


Asp 


Arg 


Glu 


Met 


Pro 


Asp 


Ser 










685 








Thr 


Gin 


Asp 


Ala 


Gin 


His 


Ser 


Asp 








700 










Gin 


Arg 


Ser 


Leu 


Glv 


Pro 


Asp 


Ala 






715 










720 


Trn 


Arg 


Leu 


lie 


Cys 


Asp 


Thr 


Phe 




73 0 










735 




Phe 


Gly 


Arg 


Gly 


lie 


Met 


He 


Ala 


745 










750 






Glv 


lie 


Glu 


Tvr 

**• Jr 


His 


Glu 


Gin 


Pro 










765 








lie 


Ser 


Asn 


lie 


Val 


Phe 


Thr 


Ser 








780 










Lys 


Leu 


Leu 


Val 


Tvr 

Jr 


Glv 


Pro 


Phe 






795 










800 


lie 


Phe 


Asp 


Gly 


Val 


lie 


Val 


Val 




810 










815 




Gin 


Gin 


Glv 


Glv 


Glv 


Leu 


Ser 


Val 


fi 9 ^ 










83 0 






Val 


Leu 


Lys 


Leu 


Val 


Arg 


Phe 


Leu 










845 








Val 


Leu 


Met 


Lys 


Thr 


Met 


Asp 


Asn 








860 










Met 


Leu 


Phe 


lie 


Phe 


He 


Phe 


Ser 






875 










880 
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He 


Leu 


Gly 


Met 


His 


Leu 


Phe 


Gly 


Cys 


Lys 


Phe 


Ala 


Ser 


Glu 


Arg 


Asp 










885 










890 










895 




Gly 


Asp 


Thr 


Leu 


Pro 


Asp 


Arg 


LVS 


Asn 


Phe 


Asp 


Ser 


Leu 


Leu 


Trp 


Ala 








900 










905 










910 






He 


Val 


Thr 


Val 


Phe 


Gin 


He 


Leu 


Thr 


Gin 


Glu 


Asp 


Trp 


Asn 


Lvs 


Val 






915 










920 










925 








Leu 


Tyr 


Asn 


Gly 


Met 


Ala 


Ser 


Thr 


Ser 


Ser 


Trp 


Ala 


Ala 


Leu 


Tyr 


Phe 




930 










935 










940 










He 


Ala 


Leu 


Met 


Thr 


Phe 


Gly 


Asn 


Tyr 


Val 


Leu 


Phe 


Asn 


Leu 


Leu 


Val 


945 










950 










955 










960 


Ala 


He 


Leu 


Val 


Glu 


Glv 


Phe 


Gin 


Ala 


Glu 


Glv 


Asp 


Ala 


Asn 


Lys 


Ser 










965 










970 










975 




Glu 


Ser 


Glu 


Pro 


Asp 


Phe 


Phe 


Ser 


Pro 


Ser 


Leu 


Asp 


Gly 


Asp 


Gly 


Asp 








980 










985 










990 






Arg 


Lys 


Lys 


Cys 


Leu 


Ala 


Leu 


Val 


Ser 


Leu 


Gly 


Glu 


His 


Pro 


Glu 


Leu 






995 










1000 








1005 






Arg 


Lys 


Ser 


Leu 


Leu 


Pro 


Pro 


Leu 


He 


He 


His 


Thr 


Ala 


Ala 


Thr 


Pro 




1010 








1015 








1020 








Met 


Ser 


Leu 


Pro 


Lys 


Ser 


Thr 


Ser 


Thr 


Gly 


Leu 


Gly Glu Ala 


Leu 


Gly 


1025 








1030 








1035 








1041 


Pro 


Ala 


Ser 


Arg 


Arg 


Thr 


Ser 


Ser 


Ser 


Gly 


Ser 


Ala 


Glu 


Pro 


Gly 


Ala 










1045 








1050 








1055 


Ala 


His 


Glu 


Met 


Lys 


Ser 


Pro 


Pro 


Ser 


Ala Arg 


Ser 


Ser 


Pro 


His 


Ser 








1060 








1065 








1070 




Pro 


Trp 


Ser 


Ala 


Ala 


Ser 


Ser 


Trp 


Thr 


Ser Arg Arg 


Ser 


Ser 


Arg 


Asn 






1075 








1080 








1085 






Ser 


Leu 


Gly Arg Ala 


Pro 


Ser 


Leu 


Lys 


Arg Arg 


Ser 


Pro 


Ser 


Gly 


Glu 




1090 








1095 








1100 








Arg 


Arg 


Ser 


Leu 


Leu 


Ser 


Gly Glu Gly Gin Glu 


Ser 


Gin 


Asp 


Glu 


Glu 


1105 








1110 








1115 








1121 


Glu 


Ser 


Ser 


Glu 


Glu 


Glu 


Arg 


Ala 


Ser 


Pro 


Ala 


Gly 


Ser 


Asp 


His 


Arg 










1125 








1130 








1135 


His 


Arg 


Gly 


Ser 


Leu 


Glu 


Arg 


Glu 


Ala 


Lys 


Ser 


Ser 


Phe 


Asp 


Leu 


Pro 








1140 








1145 








1150 




Asp 


Thr 


Leu 


Gin 


Val 


Pro 


Gly 


Leu 


His 


Arg 


Thr 


Ala 


Ser 


Gly Arg 


Gly 






1155 








1160 








1165 






Ser 


Ala 


Ser 


Glu 


His 


Gin 


Asp 


Cys 


Asn 


Gly Lys 


Ser 


Ala 


Ser 


Gly Arg 




1170 








1175 








1180 








Leu Ala Arg Ala 


Leu 


Arg 


Pro 


Asp Asp 


Pro 


Pro 


Leu Asp 


Gly Asp 


Asp 



1185 1190 1195 1200 

Ala Asp Asp Glu Gly Asn Leu 
1205 

<210> 53 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer for PCR (GAPDH) 
<400> 53 

cggagtcaac ggattggtcg tat 23 

<210> 54 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 



WO 01/019845 



PCT/US00/25479 



21 



<223> primer for PCR (GAPDH) 



<400> 54 

agccttctcc atggtggtga agac 



24 



<210> 55 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 55 

aaaaaaccca aactacaaaa ac 22 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 



<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> y = C or T 

<400> 57 

aactatcycc aacyccacaa 20 

<210> 58 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> r - G or A 



<400> 56 

gttgttggrg ttgttggr 



18 



<400> 58 

aagagatttt tttttttttt tttrgt 



26 



WO 01/019845 



PCT/US00/25479 



22 



<210> 59 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> y = . C or T 

<400> 59 

aaaatccyaa aaaaaacycc ccc 23 

<210> 60 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> r - G or A 



<210> 61 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite- PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y - C or T 

<400> 61 

aacyatccct ccctctaacc tac 23 

<210> 62 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 



<400> 60 

ggaagtttta ggggrgtagg ggaa 



24 



<400> 62 

aggtagtatg gtgaggtttg tttt 



24 



<210> 63 
<2H> 22 
<212> DNA 



WO 01/019845 PCT/US00/25479 

23 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 63 

atcaatacta aacraaatca aa 22 

<210> 64 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 64 

aggaaaagaa aggtaaggg 19 

<210> 65 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> r - G or A 

<400> 65 

caaaattaac rcaataaaaa aa 22 

<210> 66 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 66 

tatttgaaga ggtggggaaa 2 0 

<210> 67 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 



<400> 67 

aaactcttac cccacctaac c 



21 



WO 01/019845 
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24 

<210> 68 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 68 

ggtttgtaat tggattaaay gtt 2 3 

<210> 69 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 69 

ccactaactc aaaactaaaa aa 22 

<210> 70 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 70 

gggaggtgta aaaggatgaa a 21 

<210> 71 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 71 

ctaacactaa aataaaaata aa 22 

<210> 72 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> y = C or T 



WO 01/019845 PCT/US00/25479 

25 

<400> 72 

gtaggatgtt ataygaagag 2 0 

<210> 73 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> mis cofeature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 73 

aaacrctaac raacatacta c 21 

<210> 74 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 74 

gggttttttt tagggtattt 2 0 

<210> 75 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 75 

gaattaaatt tcaaaaaaac cr 22 

<210> 76 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 76 

tttaggagga tgyggagtt 19 



<210> 77 



WO 01/019845 



PCT/US00/25479 



26 

<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_feature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 77 

aaaaaaccta acraaacact ta 22 

<210> 78 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 78 

gttattgtgt agtggagttt gg 22 

<210> 79 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 79 

actccratta acaaaccaac 20 

<210> 80 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 80 

aatatggttt yggttggtaa 2 0 

<210> 81 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 81 

tccctaaatt ccacacatt 19 

<210> 82 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 82 

gtaagttgta gttggttgtt tta 23 

<210> 83 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Target sequence for bisulf ite-PCR primer 

<221> mis cofeature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 83 

ctctctacta ccraattcct ct 22 

<210> 84 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 84 

gttttggttt tggttgtg 18 

<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence for bisulf ite-PCR primer 
<400> 85 

ccactaccaa acaaatcccc 20 

<210> 86 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Target sequence for bisulf ite-PCR primer 
<400> 86 

tttattgggg aatttcggg 19 

<210> 87 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> mis cofeature 
<222> (0) . . . (0) 
<223> r - G or A 

<400> 87 

aacaaaataa ctactacrcc rtc 23 

<210> 88 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 88 

gtaaagtgag gggtggtgat g 21 

<210> 89 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Target sequence 

<221> misc_feature 
<222> (0) ... (0) 
<223> r = G or A 

<400> 89 

ctccaaaaaa ctataaatac ccraa 25 

<210> 90 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> y = C or T 



<400> 90 

gagtgagtga aggyggtaga tt 



22 



WO 01/019845 
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<210> 91 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 91 

aacctcacat taacrctcct aaa 23 

<210> 92 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 92 

gtttttttaa gattgggttt ttttag 26 

<210> 93 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Target sequence 
<400> 93 

caaaccccaa acatccttta tcca 24 

<210> 94 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 94 

ggatttaggg gtaaggggag gg 22 

<210> 95 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> mis cofeature 
<222> (0) . . . (0) 
<223> r = G or A 



WO 01/019845 



PCT/US00/25479 



<400> 95 

aaaaaccaca actaaaatcc ratt 

<210> 96 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 96 

agtgagggat ttagttgtgg tgtg 

<210> 97 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> r = G or A 

<400> 97 

aactatcrcc aacrccacaa 

<210> 98 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Target sequence 

<221> mi sc_f eature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 98 

aagagatttt tttttttttt tttygt 

<210> 99 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc_feature 
<222> (0) . . . (0) 
<223> r = G or A 



30 

24 



24 



20 



<400> 99 

aaaatccraa aaaaaacrcc ccc 



23 



WO 01/019845 
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<210> 100 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> y = C or T 

<400> 100 

ggaagtttta ggggygtagg ggaa 

<210> 101 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 101 

aacaaaatac aactcccaaa caccc 

<210> 102 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 
<400> 102 

ttagggtttg attttttaat ttggtt 

<210> 103 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Target sequence 

<221> misc_f eature 
<222> (0) ... (0) 
<223> x - G or A 

<400> 103 

caaaaaatta cratcccccc tc 

<210> 104 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Target sequence 



31 



24 



25 



26 
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<221> misc__f eature 
<222> (0) ... (0) 
<223> y = C or T 



<400> 104 

ttggaggtat aataaggaga tttygg 2 6 

<210> 105 
<211> 576 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> gene 

<222> (0) ... (0) 

<223> APOB CpG ISLAND 



<400> 105 

cccgggaggc gccctttgga ccttttgcaa tcctggcgct cttgcagcct gggcttccta 60 

taaatggggt gcgggcgccg gccgcgcatt cccaccggga cctgcggggc tgagtgccct 12 0 

tctcggttgc tgccgctgag gagcccgccc agccagccag ggccgcgagg ccgaggccag 18 0 

gccgcagccc aggagccgcc ccaccgcagc tggcgatgga cccgccgagg cccgcgctgc 240 

tggcgctgcc tgcgctgctg ctgctgctgc tggcgggcgc cagggccggt gagtgcgcgg 3 00 

ccgctctgcg ggcagcagag ggagcgggag ggagccggcg gaccgaggtt ggccggggca 3 60 

gcctgggcct aggccagagg gagggcagcc acagggtcca gggcgagtgg ggggattgga 42 0 

ccagctggcg gcccctgcag gctcaggatg gggggcgcgg gatggagggg ctgaggaggg 4 80 

ggtctccgga gcctgcctcc ctcctgaaag gtgaaacctg tgccggtggt ccccctgtcg 540 

ggcccctagc acccgctggg aagacgtggg aagctc 57 6 

<210> 106 

<211> 2093 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> gene 

<222> (0) ... (0) 

<223> CACNA1G CpG ISLAND 



<400> 106 

cctgcggccc tacgccagga ccccgcgccg aatactctga ttcttcgggc tccctccaag 

ggagtcccaa agaccccaat ggccaatagg aaagtgggtt cggtctgggc agcagtctga 12 0 

ttggctccag ccttcgggag cggacccagg ggcaagggga ggggagaggg gcggtcctgg 18 0 

gttttggggt gggaatcgga ttccagctgt ggttctctcc ctgcgctccc gcccgcactg 24 0 

ccacggcgga cggccaatgg gcgcgcggct cggggccggc ggcgtccggc gattggctgc 300 

ggggctgtct gggggcgggg ccgaggcttg aagttgaagt gagggatcca gctgtggtgt 3 60 

gcgcggggct cctcgccgcc gctttcgctc gctcgctccg cgtctcggcc ggaggaggag 42 0 

gctgtggcgc cggcgacagc tacggcagcg gcagccaccg cggcggctgc ggcggcggca 480 

tctccgcctc cactcccgcc cgggactgcc ccccactgtc tccccgcccc tcccggacag 54 0 

tgagcccgcg gcggggcggg ggaaggagcc gcccccaccc cctccaagcc cacccctaaa 600 

gagatccctc ctcccctccc ccgccgcctg gcgcggagcc gggacgatgc tgacccctta 660 

gatccggctc cagctgcgcc gcgggaagag ggggcgcccc tccccggacc cccgccctcc 72 0 

gccgctgccc cccttttcgt tcgccctctc ggggcggctt cgccgaaggt agcgccgaat 780 

ccggcaaccg gagcctgggc gcgaagcgaa gaagccggaa caaagtgagg gggagccggc 84 0 

cggctggccc gggaagcccc aggggcgcag gggaagcggg actcgcgccg ggcggggttt 90 0 

ccctgcgccc cggcgccccg cgggcagcat gcccctgcgg gcagggggag ctgggctgaa 960 

ctggccctcc cgggggctca gcttgcgccc tagagcccac cagatgtgcc cccgcegggg 102 0 

cccccgggtt gcgtgaggac acctcctctg aggggcgccg cttgcccctc tccggatcgc 1080 

ccggggcccc ggctggccag aggatggacg aggaggagga tggagcgggc gccgaggagt 1140 

cgggacagcc ccggagcttc atgcggctca acgacctgtc gggggccggg ggcggccggg 12 0 0 



60 
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gccggggtca gcagaaaagg acccgggcag cgcggactcc gaggcggagg ggctgccgta 12 6 0 

cccggcgctg gccccggtgg ttttcttcta cttgagccag gacagccgcc cgcggagctg 1320 

gtgtctccgc acggtctgta acccatatcc ttcggggcac gacggccagg cgcggggtca 13 8 0 

gaagggggac gggccgcacc gccgggggtc gggggggaag aagacccacc gccaggtgag 144 0 

tcgaagtgag cccggagggt aggcggatgg ggggggggct gccagggagg ggagggggca 150 0 

ccagagtggg agcggagacg cgagcaggtc tcgtcggtaa cccgggctta ccccacctgc 1560 

gtacacacac ctcagtcttc ctgggttggg ggggtgggga tccaggccag gagaagagag 162 0 

ctgtgccccg ctggctcgca gctggacgcc ctccagatgt ggtcagggga gggtcgtcat 168 0 

cctccagatg tgggaagctt cgggagcctg ggagctgtac tctgcccgcg ccggttagcg 1740 

agctgggttt ggtttccgag tttggtgggg ggtgggtggg ggcggtgggg aggaagctgc 1800 

ggggacggag gagggggga-c cgcaatctcc tgggtttccc tccttccccc gccccaaagt 1860 

ttgcggcgga ttctagatgt tggggggcgg ggaccaggtc ctggcccacc tcacccccca 192 0 

cctcgcgggt tggaggcaca acaaggagat tccggcggcg gctgatgtca ggggcgcaga 19 8 0 

atgagaacaa gatgtggtgg aggggagctg tctgcccccg gagctgggag tggagcccct 2 04 0 

ttccgctaga gcccagtgcc gcgggtgcct cctacccgat ctccattcga tgc 2093 



<210> 107 

<211> 327 

<212> DNA 

<213> Homo sapiens 

<220> 

<2 21> gene 

<222> (0) ... (0) 

<223> CDX2 CpG Island 



<400> 107 

ctcgttaatc acggaagccg ccggcctggg gctccgcacg ccagcctgtg cgggtcttcc 60 

ccgcctctgc agcctagtgg gaaggaggtg ggaggaaaga aggaagaaag ggagggaggg 12 0 

aggaggcagg ccagagggag ggaccgcctc ggaggcagaa gagccgcgag gagccagcgg 18 0 

agcaccgcgg gctggggcgc agccacccgc cgctcctcga gtcccctcgc ccctttccct 24 0 

tcgtgccccc cggcagcctc cagcgtcggt ccccaggcag catggtgagg tctgctcccg 300 

gtccctcgcc accatgtacg tgagcta 327 



<210> 108 

<211> 1663 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> gene 

<222> (0) ... (0) 

<223> EGFR CpG Island 



<400> 108 

gtccgcgggg accgggtcca gaggggcagt gctgggaacg cccctctcgg aaattaactc 6 0 

ctcagggcac cgctcccctc ccatgcgccg ccccactccc gccggagact aggtcecgcg 12 0 

ggggccaccg tgtccaccgc ctcgcggccg ctggccttgg gtccccgctg ctggttctcc 180 

tccctcctcc tcgcattctc ctcctcctct gctcctcccg atccctcctc cgccgcctgg 240 

tccctcctcc tcccgccctg cctcccgcgc ctcggcccgc gcgagctaga cgtccgggca 300 

gcccccggcg cagcgcggcc gcagcagcct cctccccccg cacggtgtga gcgcccgccg 360 

cgccgaggcg gccggagtcc cgagctagcc ccgcggccgc cgccgcccag accggacgac 42 0 

aggccacctc gtcgcgtccg cccgagtccc cgcctcgccg ccaacgccac aaccaccgcg 480 

cacggccccc tgactccgtc cagtattgat cgggagagcc ggagcgagct cttcggggag 540 

cagcgatgcg accctccggg acggccgggg cagcgctcct ggcgctgctg gctgcgctct 600 

gcccggcgag tcgggctctg gaggaaaaga aaggtaaggg cgtgtctcgc ggctccccgc 660 

cgcccccgga tcgcgccccg gaccccgcag cccgcccaac cgcaccgcgc accggcttcg 720 

cccgcgcccc cgcccgtcct ttcctgtttc cttgagatca cgtgcgccgc cgaccgggac 780 

cgcgggagga acgggacgtt tcgttcttcg gccgggagag tctggggcgg gcggaggagg 840 

agacgcgtgg gacaccgggc tgcaggccag gcggggaacg gccgccggga cctccggcgc 90 0 
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cccgaaccgc tcccaacttt cttccctcac tttccccgcc cagctgcgca ggatcggcgt 960 

cagtgggcga aagccgggtg ctggtgggcg cctggggccg gggtcccgca cgggctcccc 102 0 

gcgctgtctt cccagggcgc gacggggtcc tggcgcgcac ccgagggccg ctgcccaccc 1080 

gccgagactg cctgtttagg gaagctgagg aaggaaccca aaaatacagc ctccgctcgg 114 0 

accccgcggg acaggcggct ttctgagagg acctccccgc ctccgcgctc cgcgcaggtc 12 0 0 

tcaaactgaa gccggcgccc gccagcctgg ccccggcccc tctccaggtc cccgcgatcc 1260 

tcgttcccca gtgtggagtc gcagcctcga cctgggagct gggagaactc gtctaccacc 1320 

acctgcggct cccggggagg ggtggtgctg gcggcggtta gtttcctcgt tggcaaaagg 13 8 0 

caggtggggt ccgacccgcc ccttgggcgc agaccccggc cgctcgcctc gcccggtgcg 144 0 

ccctcgtctt gcctatccaa gagtgccccc cactcccggg accccagctc cctccgcgcc 1500 

cgcgccgaaa gccccaggct ctccttcgat ggccgcctcg cggagacgtc cgggtctgct 1560 

ccacctgcag cccttcggtc gcgcctgggc ttcgcggtgg agcgggacgc ggctgtccgg 162 0 

ccactgcagg gggggatcgc gggactcttg agcggaagcc ccg 1663 



<210> 109 

<211> 1787 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> gene 

<222> (0) . . . (0) 

<223> FBN1 CpG Island 

<221> misc_f eature 
<222> (1) . . . (1787) 
<223> n = A,T,C or G 



<400> 109 

agagccgcgt ctggagtggg ctctcgacac ccagggcaag tgggggcggc agagccctct 6 0 

cctcggtcgg cacagcagcc tctgccgcgg tcccggcctg cgacgcgccc agtcttagcc 120 

tcccggcctc cggcgtctgc tgagtgtccg gcgggagagg cgcagggagc gcgctaccgg 18 0 

gaggcgcggg cagcggggac tggttttctc tcgggccagg gcctccgggg caaccgtctc 240 

cagcgcgcat tcttggtgca ggtggaacag ctttctgctc cggtagggct tcacctatcg 3 00 

cgggagaggt taatctcgga tctaaacctc gcagccgcag agcgggctaa aaccgctact 3 60 

ccacctcttc ccatttctcc cctccccacc tcaagacaaa aagtcccagg ccgggcagga 420 

cctgatcacc tctgcctcct cccactgcgc taatcctgcg agcgagaggc cccgcaccga 480 

ggcggaggct gcaaagggga gtggaaaggg aggatggatg gggccggggg gtggggtggt 54 0 

gatgagggcg acgaaggagg gggtgtcatt ttctttttct ttcttttttt aaaaaaagta 60 0 

tttctctcgc gagaaaccgc tgcgcggacg atacttgaag aggtggggaa aggagggggc 66 0 

tgcgggagcc gcggcagaga ctgtgggtgc cacaagcgga caggagccac agctgggaca 72 0 

gctgcgagcg gagccgagca gtggctgtag cggccacgac tgggagcagc cgccgccgcc 78 0 

tcctcgggag tcggagccgc cgcttctcca gtgggtgcag ccggggtccg acgggggtcg 84 0 

ggcggccacc ggggctggag ctgcggccac ggaggctttt gcgtttgcgc cgnnngaggg 900 

cagggacagg gactggggtg aggggctgtc ccggaacgtc caacgtggnc gctggaccct 960 

cccctgcctg acagcttcct gnccggggct tcttggtgcc ggnccggcgt cagatgttcg 1020 

99999°99^9 catcgcccgg agtcggcggg gacggcgcgg ctgcttccag ctggcggaga 10 8 0 

gggcaggctg aggagtgggg cgttcagagc gcgcatcgcg cgcaattcgt gccgctaaaa 1140 

aaaataaacc cagagagctc gcccggggct taggaccgct ggggatatgg gtactttgcg 12 0 0 

ccgcgctctt ctggcggggc ccgggaggcg agggattggc cggggctgct gcgccggggc 12 60 

ctgggctttc cagccagctg tggaccaaac ggtcttccct tacccaaatt aactgcgcca 1320 

caggcggccg acnggttggg ctttgggaat ggggaccgcg agcttcagca tcccgatgcc 13 8 0 

ctgaaagtct ccccgcctcg gggatttgtc tctgtgttgc agctggcagg ggccgcctga 144 0 

agtgggagca gcgcctggag aaggcgggag gagcccggcc cgggggacgg gcggcgggat 15 0 0 

agcgggaccc cggcggcgcg gtgcgcttca gggcgcagcg gcggccgcag accgagcccc 1560 

gggcgcggca agaggcggcg ggagccggtg gcggctcggc atcatgcgtc gagggcgtct 1620 

gctggagatc gccctgggat ttaccgtgct tttagcgtcc tacacgagcc atggggcgga 1680 

cgccaatttg gaggctggga acgtgaagga aaccagagcc agtcgggcca agagaagagg 174 0 

cggtggagga cacgacgcgc ttaaagggta aaggaaccgg ttccctc 1787 
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<210> 110 
<211> 810 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> gene 

<222> (0) . . . (0) 

<223> GPR37 CpG Island 



<400> 110 

tcccgccccg cacccgcccc tagcccgggc tcggggacct gtcaggctgg tttcgacagc 60 

tggggaatta acctgtcccg cccatcccta gcctcgagcc gcgcaggctc cgcgcctccg 12 0 

cccttgttcc ctcccagctc ctccgagtgg aagccgctac aaatggcttg aatgaaacgt 18 0 

gtgtgggttt agtgagtggt gaaccaccag gggatcccgt ctccccacaa accagtatct 24 0 

ctccgaggag gaggcgaagg agtgggagga ggcaacgagc cgagagtcga gcttcgcggg 3 00 

cgcgcgcagc ggctggagcg cgggggcgag gccgggccac ctccccttcc cggccgcgca 3 60 

ctgcctggcc cgcggcggtt ccaggcacca cccttcccgt ccgggctgag cccgctgtgg 42 0 

cagtgactag ctcccgcggc tagcggcact gtccaccgac gagcggcgcc ctcttctccc 4 80 

ccttctcccc acgatttcct tctctgcggc ggcacgccgt ccagcagcct gcttcgcccc 54 0 

gtcgtcaact ttgagctgga ggagaagcaa ctttggcagt ggccgcgggg ttggaatccc 600 

gcttctcctc ggcagcagta ggctcgcaag tcgctggggt taggtggggc aagagtttcg 660 

ccggcgcatc agcgctgctt cggactgttt gcaacgtgtt tccagcgagc tgggagcggg 72 0 

gttgtgactg cgagtcgtct gggggagggg gacttgtttt tcttttcctc tagagacctc 78 0 

ggcttgcaac tggatcaaac gctgtcgaaa 810 



<210> 111 
<211> 550 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> gene 

<222> (0) . . . (0) 

<223> HSPA6 CpG Island 



<400> 111 

tgtattcgca tggtaacata tcttcggtct tcctgccgct gggctctcag cggccctcca 60 

aggcagcccg caggcccgtg ctcgcctcag ggatcctcca cagccccggg gagaccttgc 12 0 

ctctaaagtt gctgcttttg cagctctgcc acaaccgcgc gtcctcagag ccagccggga 180 

ggagctagaa ccttccccgc gtttctttca gcagccctga gtcagaggcg ggctggcctt 24 0 

gcaagtagcc gcccagcctt cttcggtctc acggaccgat ccgcccgaac cttctcccgg 3 00 

ggtcagcgcc gcgctgcgcc gcccggctga ctcagcccgg gcgggcgggc gggaggctct 3 60 

cgactgggcg ggaaggtgcg ggaaggttcg cggcggcggg gtcggggagg tgcaaaagga 42 0 

tgaaaagccc gtggacggag ctgagcagat ccggccgggc tggcggcaga gaaaccgcag 4 80 

ggagagcctc actgctgagc gcccctcgac gcgggcggca gcagcctccg tggcctccag 54 0 

catccgacaa 550 



<210> 112 
<211> 278 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> gene 

<222> (0) . . . (0) 

<223> IQGAP2 CpG Island 

<400> 112 

agagttcact tttacttcag tgtcagcgcg cggcggccgt ggctggctct ggcgagagag 60 
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caccgaggga 
ccgagcgcgc 
gcgcgccccg 
gcagagaccc 



gtgggtcgca 
cggcggggcg 
ggcgggcccc 
cgctatggct 



gatcttcggg 
cagagcccgc 
cggagacgcg 
ctattgtgga 



cggctagggg aaatcggcga gaggcgggat 
gagcctggcc agcgagggta gccgcggggg 
caggatgcca cacgaagagc tgccgtcgct 
cgatgaaa 



120 
180 
240 
278 



<210> 113 

<211> 1461 

<212> DNA 

<213> Homo sapiens 

<220> 

<2 21> gene 

<222> (0) ... (0) 

<22 3> KXi CpG Island 

<400> 113 

ctcgaaagag gggcgcgggt gggcgcgtct ccccgcgagc atctcaccta agggggaatc 6 0 

cctttcagcg cacggcgaag ttccccctcg gctgtcccac ctggcagtcc ctctaggatt 12 0 

tcggccagtc cctaattggc tccagcaatg tccagccgga gcttctttgg gcctccgagt 180 

gggagaaaag tgagagcagg tgcttcccca gcggcgcgct ccgctagggc ccggcaggat 24 0 

cccgccccca agtcgggaaa agttggtcgg cgccttttct ccccgacgaa gccgctccag 3 00 

ggctgctctc agaggacgcg cggcaggcaa agagaatgaa cctgagcgtc cacgaaacgt 3 60 

cctgcacggc tcccgggagc tgggagaaac aggtgccttt ctccgacgtc cgcgggcgac 42 0 

gcctgccgca ccttgcccgc tgccgcgccc ctcccgggca cccctcgccc tcggcgcccc 480 

tgcccccacc cccagtgcca gggcggaggc agtcccggct cgcaggtaat tattgccagc 54 0 

ggagcccgcc ggggagcggg ggtgggcgcg ccggcggtgg gcgggcgggc gcggcggggc 60 0 

gcgggcataa aggggcgcgg cgcggggccc cggagcctgg ctcccg'cgca gcatgcccgc 66 0 

cagcgccccg ccgcgccgcc cgcggccgcc gccgccgtcg ctgtcgctgc tgctggtgct 72 0 

gctgggcctg ggcggccgcc gcctgcgtgc ggagccgggc gacggcgcgc agacctgggc 78 0 

ccgtttctcg cggcctcctg cccccgaggc cgcgggcctc ttccagggca ccttccccga 84 0 

cggcttcctc tgggccgtgg gcagcgccgc ctaccagacc gagggcggct ggcagcagca 90 0 

cggcaagggt gcgtccatct gggacacgtt cacccaccac cccctggcac ccccgggaga 960 

ctcccggaac gccagtctgc cgttgggcgc cccgtcgccg ctgcagcccg ccaccgggga 102 0 

cgtagccagc gacagctaca acaacgtctt ccgcgacacg gaggcgctgc gcgagctcgg 108 0 

ggtcactcac taccgcttct ccatctcgtg ggcgcgagtg ctccccaatg gcagcgcggg 114 0 

cgtccccaac cgcgaggggc tgcgctacta ccggcgcctg ctggagcggc tgcgggagct 120 0 

gggcgtgcag cccgtggtca ccctgtacca ctgggacctg ccccagcgcc tgcaggacgc 126 0 

ctacggcggc tgggccaacc gcgccctggc cgaccacttc agggattacg cggagctctg 132 0 

cttccgccac ttcggcggtc aggtcaagta ctggatcacc atcgacaacc cctacgtggt 13 8 0 

ggcctggcac ggctacgcca ccgggcgcct ggcccccggc atccggggca gcccgcggct 144 0 

cgggtacctg gtggcgcaca a 14 61 

<210> 114 

<211> 249 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> gene 

<222> (0) ... (0) 

<223> PAR2 CpG Island 

<400> 114 

cccggggcgt ggcctcccgc aggtgagtac gctgctcctt cggtttccct gaaacctaac 6 0 

ccgccctggg gaggcgcgca gcagaggctc cgattcgggg caggtgagag gctgactttc 120 

tctcggtgcg tccagtggag ctctgagttt cgaatcggcg gcggcggatt ccccgcgcgc 18 0 

ccggcgtcgg ggcttccagg aggatgcgga gccccagcgc ggcgtggctg ctgggggccg 24 0 

ccatcctgc 249 



<210> 115 
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<211> 709 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> gene 

<222> (0) ... (0) 

<223> PITX2 CpG Island 



<400> 115 

agtccgtgct 

gctctcccgg 

tcgtttctcg 

tggatcgcca 

tttggtggaa 

agagagggat 

aggagcagaa 

ctccttcgga 

gcttgccgtg 

ggagacagca 

ctggtgtcgg 

gtgggcgccg 



cctgctcctc 
ctccggctcc 
tcgcccctgc 
gccgcgcagc 
tctctgctga 
gagagggagg 
agaaactgcc 
acttggcacc 
cagccggagg 
gctctctccc 
cgtgtctgca 
aggacccgtc 



ggttggctcc 
cgactcttcg 
tcgctccccc 
cgggctcggc 
cgtcacgtca 
gagaggagag 
agtggcggct 
ctcaggagcc 
ctcggctcgc 
ggtagccgat 
attagagaaa 
taagaagaag 



taagtgcccc 
gcccgctggc 
cggcgctcgc 
cggccgcccg 
ctccccacac 
agagtgcgag 
agatttcgga 
ctgcagtcct 
tggaaatcgc 
aacggggaaa 
gataaaagcc 
cggcaaaggc 



gccaggtccc 
atctgcttcc 
ccgggcgctg 
cgcgccactg 
ggagtaggag 
accgagcgag 
ggccccagtg 
ctcaggcccg 
cccgggaagc 
tggagaccaa 
agcaggggaa 
ggcagcgga 



ctctcctttc 
ctcccctgcc 
tgctcgctcc 
tgcagtggag 
cagagggaag 
aaagctggag 
cacccgtgga 
gctttcgggc 
agtgggacgc 
ctgccgcaaa 
gaatgaggac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
709 



<210> 116 

<211> 1496 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> gene 

<222> (0) ... (0) 

<223> PTCA CpG Island 

<22 1 > misc^feature 
<222> (1) . . . (1496) 
<223> n = A, T, C or G 



<400> 116 

gcggccgcag cggcagcagc gcccgccgtg tgagcagcag cagcggctgg tctgtcaacc 6 0 

ggagcccgag cccgagcagc ctgcggccag cagcgtcctc gcaagccgag cgcccaggcg 12 0 

cgccaggagc ccgcagcagc ggcagcagcg cgccgggccg cccgggaagc ctccgtcccc 180 

gcggcggcgg cggcggcggc ggcaacatgg cctcggctgg taacgccgcc gagccccagg 24 0 

accgcggcgg cggcggcagc ggctgtatcg gtgccccggg acggccggct ggaggcggga 3 00 

ggcgcagacg gacggggggg ctgcgccgtg ctgccgcgcc ggaccgggac tatctgcacc 3 60 

ggcccagcta ctgcgacgcc gccttcgntn nggagnagat ttnccanggn nggcatttca 42 0 

gactntntcn ttcccacttt ntcttcccnt acctntaact cntnggggat cgcccccgcc 480 

acacacaaac acacacactn tcttcctctn tntctcacac acaacacaca cactcactca 54 0 

cacntctnca ggaaaagcag cagacaaatg gggattgaaa aattcaaacc ctccctctgg 600 

tnntgggagg aaagggctgt ctgaggtccg cagggggtgg aggtgtgtgt gtgtgcgtgt 660 

gtgtgtgtgn anacacacgc cctccctggt gtgccttttc cggagcactg gaaagccgtc 72 0 

cacggcggac cacctcaagg gcggccgcgc ggtcgtagcg gtagtagcgt tcgtcgtgtg 780 

agtagtagta gcggttggtt tgttaatcgg agttcgagtfc cgagtagttt gcggttagta 84 0 

gcgttttcgt aagtcgagcg tttaggcgcg ttaggagttc gtagtagcgg tagtagcgcg 90 0 

tcgggtcgtt cgggaagttt tcgttttcgc ggcggcggcg gcggcggcgg taatatggtt 96 0 

tcggttggta acgtcgtcga gttttaggat cgcggcggcg gcggtagcgg ttgtatcggt 1020 

gtttcgggac ggtcggttgg aggcgggagg cgtagacgga cgggggggtt gcgtcgtgtt 10 8 0 

gtcgcgtcgg atcgggatta tttgtatcgg tttagttatt gcgacgtcgt tttcgntnng 1140 

gagnagattt nttanggnng gtattttaga ttntnttntt tttattttnt ttttttntat 12 0 0 

ttntaatttn tnggggatcg ttttcgttat atataaatat atatattntt tttttttntn 1260 

ttttatatat aatatatata tttatttata tntttntagg aaaagtagta gataaatggg 1320 
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gattgaaaaa tttaaatttt ttttttggtn ntgggaggaa agggttgttt gaggttcgta 13 8 0 

99gggtggag gtgtgtgtgt gtgcgtgtgt gtgtgtgnan atatacgttt tttttggtgt 144 0 

gtttttttcg gagtattgga aagtcgttta cggcggatta ttttaagggc ggtcgt 1496 

<210> 117 
<211> 701 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> gene 

<222> (0) ... (0) 

<223> PTCHB CpG Island 

<221> misc_feature 
<222> (1) . . . (701) 
<223> n = A, T, C or G 



<400> 117 

gcggccgcgg cactgtcctg ccccgtgccc cctgccctga acttcttcct cctgcgcccc 60 

tgcccctatt tgcagcctaa actcctgtac ggctgccaca tttcttaaca tcttggaggg 120 

ggaggcggag tggagagagg cggagagagg aaggggggag ggagccgaaa taaaggtggt 18 0 

ttcctttttt ggcagccagt tttggttttg ttgagcatga aatctctgct cccttaaaaa 240 

attattctcg gaaaaagata tcccccccgt tttccaggtt ttgagccgcc tctccttagg 300 

gcctggtcgg gggaggaaaa gttgtaaaca aattgccacc ttaaattcgc ggtgcgantc 360 

tgcggagctg ccgggttcat tgtgtttacg aggctcgctg aaatgtgtgg aatccaggga 420 

aggcgagcac ccagacgggg gcccgccggg gccgcggcca gcgccgggga aatgccgcgc 480 

cggggagcag catgcgccgg cctgagccct tccctttgca ctcggctgtt ttttacgttt 540 

aaccagaaag gaagggagag gagggaaaga tccatgtggc tgccctcttc cgatcacaaa 600 

tattgtcgta agttgcagct ggctgcccca nttcctaatt cagctcacac agcntntccc 660 

cacgctatgg aaatgcgtcg ggagtgaact ccggcggccg c 701 



<210> 118 

<211> 273 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> gene 

<222> (0) . . . (0) 

<223> SDC1 CpG Island 



<400> 118 

ggagaggtgc gggccgaatc cgagccgagc gagaggaatc cggcagtaga gagcggactc 60 

cagccggcgg accctgcagc cctcgcctgg gacagcggcg cgctgggcag gcgcccaaga 12 0 

gagcatcgag cagcggaacc cgcgaagccg gcccgcagcc gcgacccgcg cagcctgccg 180 

ctctcccgcc gccggtccgg gcagcatgag gcgcgcggcg ctctggctct ggctgtgcgc 240 

gctggcgctg agcctgcagc tggccctgcc gca 2 73 

<210> 119 
<211> 751 
<212> DNA 

<213 > Homo sapiens 



<220> 

<221> gene 

<222> (0) . . . (0) 

<223> SDC4 CpG Island 



<400> 119 
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agtaggagcc ggcgggctcg ggcagggcgg gtcccttggg gtttccaact ccgcgggcgg 6 0 

gcgcagtgcc ccgcaggcct cgcttccact ggggaattcc gggcggggtg cgggcggcgg 12 0 

99^99999^9 ggccggggcg gggccggtag gccgcctata agatgggtgg cgcgcccgcc 180 

cggggccact cgccgcagcc tgcgcgcctt ctccagtccg cggtgccatg gcccccgccc 240 

gtctgttcgc gctgctgctg ttcttcgtag gcggagtcgc cgagtcggtg ggtgcttgga 3 00 

ggttcccggg ctgggggcga agcgggggcg caggccggtg cctcctttgt tcgtcggagc 3 60 

gtgggatggg ggggggcaga tcgggggtac gctaccccca accggacacc gaggcccggg 42 0 

aaactttgtt ggaaactttg ctccggggtc acgggccagc ctccgggatg gcttcacgcg 480 

ccgtgcgccc ctcgcctgtt gctcttcccg cctccccggg cctcagcccc gccgcgggct 540 

acgggctcgt tagtgactaa gccggtgtca actcttcaac tcccacaccc tcgtcccttc 600 

cctggtgacc ctggggcagg cttggagcgc tgaatcccct cctcgctctc ggggcgccca 66 0 

gagcagacag ctttaggatc cgagatggcc ctgggggtcg gggggctgcg tgtactcgga 72 0 

agggggaggg ttttagggtt gtgcgaggcc c 751 



<210> 120 

<211> 673 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> DNA fragment termed MINT31 

<221> misc__f eature 

<222> (1) . . . (673) 

<223> n = A, T, C or G 



<400> 120 

cccggggcct ctatcctggc gggaagggca ggccgacccg gcagactgcg gcctctcggg 6 0 

agggaagaag gtgtcagacg cgcggagcaa ccataaatag cccccctttc ccagaagacg 12 0 

gcacggggtt caagactcag gcgccgcata ctcagaatga gagcagagac tcccgccagg 18 0 

aaaaaagggc acttagggga tctgctcatt aacatgaaat gcaaatgagc ccgcccggcc 24 0 

tcatttacac aactctgtgc atggattcgg cgaaagggca accagggaga cgacggcgca 3 00 

gcagccactc tgccacttcc cccatcccct cccccccatc ggccggggcg ggaactgaga 3 60 

cgaccccaac cctctgcggc ggcgggaggt gcgcgggggc tgcgtgggtg gtgcagcctt 42 0 

aggggagtga acaacgccca ggggtgatgg cctcagcaaa gtgaggggtg gtgatggagg 48 0 

tcatccgacc catcccgccg cctctccgca gtggcgcaag cgccccaaaa tctccggaga 540 

nggaactgag tgacccacta ggttccgccg tgtctacctc tcgcagatgt tggggaagtg 600 

cttcccggcg tctaatcctc gctgttcccc cctccaccgg cgcccagcac acccgcggcg 660 

ctccgctccc ggg 673 
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